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ser lys asn thr 
val asp val asp 
asp glu glu asp 
asp glu gly glu 
asn met thr ala 
pro pro ile asn 
arg ala gly ser 
phe lys ala gly 
leu asp arg asn 
ile tyr lys gly 
ser ala val leu 
ala ala gly gly 
thr ala arg lys 
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ggtctgtgtg tgcgtgcgtg 
ttctctctct tttttttttt 
cgctgcgccc cccggggggg 
ggggaggggg gcgccgcgct 
gaggc atg gcg ggg ccc 
Met Ala Gly Pro 
1 



cgagtgagtg agtgtgtgca tatttttttt tctcttttct 
tttgcaaaga aacagcagcg ccgccgccgc tccgccgagg 
ggaggcggag gaggcgggca gcggcggagg gaggggagcc 
gggagggagg cagcgcgcac ggtgcagccg ggccgggcgg 
ccg gcc eta ccc ccg ccg gag acg gcg gcg gec 
Pro Ala Leu Pro Pro Pro Glu Thr Ala Ala Ala 
5 10 15 



qcc acc acg gcc gcg gcc gcc gcc teg teg tec gcc get tec ccg cac 
Ala Thr Thr Ala Ala Ala Ala Ala Ser Ser Ser Ala Ala Ser Pro His 
20 25 30 

tac caa gag tgg att ctg gac acc ate gac teg ctg cgc teg cgc aag 
Tvr Gin Glu Trp He Leu Asp Thr He Asp Ser Leu Arg Ser Arg Lys 
35 40 45 

acq egg ccg gac ctg gag cgc ate tgc egg atg gtg egg egg egg cac 
Ala Arg Pro Asp Leu Glu Arg lie Cys Arg Met Val Arg Arg Arg His 
50 ' 55 60 

ggc ccg gag ccg gag cgc acg cgc gcc gag etc gag aaa ctg ate cag 
Glv Pro Glu Pro Glu Arg Thr Arg Ala Glu Leu Glu Lys Leu He Gin 
65 70 75 

cag cgc gcc gtg etc egg gtc age tac aag ggg age ate teg tac cgc 
Gin Arg Ala Val Leu Arg Val Ser Tyr Lys Gly Ser lie Ser Tyr Arg 
80 85 90 95 

aac gcg gcg cgc gtc cag ccg ccc egg cgc gga gcc acc ccg ccg gcc 
Asn Ala Ala Arg Val Gin Pro Pro Arg Arg Gly Ala Thr Pro Pro Ala 
100 105 HO 

ccg ccg cgc gcc ccc cgc ggg ggc ccc gcc gcc gcc gcc gcg ccg ccg 
Pro Pro Arg Ala Pro Arg Gly Gly Pro Ala Ala Ala Ala Ala Pro Pro 
115 120 125 

ccc acg ccc gcc ccg ccg ccg ccg ccc gcg ccc gtc gcc gcc gcc gcc 
Pro Thr Pro Ala Pro Pro Pro Pro Pro Ala Pro Val Ala Ala Ala Ala 
130 135 140 

gcc ccg gcc egg gcg ccc cgc gcg gcc gcc gcc gcc get gcc gcc aca 
Ala Pro Ala Arg Ala Pro Arg Ala Ala Ala Ala Ala Ala Ala Ala Thr 
145 150 155 

gcg ccc ccc teg ccc ggc ccc gcg cag ccg ggc ccc cgc gcg cag egg 
Ala Pro Pro Ser Pro Gly Pro Ala Gin Pro Gly Pro Arg Ala Gin Arg 
160 165 170 175 

gcc gcg ccc ctg gcc gcg ccg ccg ccc gcg ccc gcc get ccc ccg gcg 
Ala Ala Pro Leu Ala Ala Pro Pro Pro Ala Pro Ala Ala Pro Pro Ala 
180 185 190 
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FIG. 2A-1 
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qcg gcg ccc ccg gcc ggc ccg cgc cgc gcc ccc ccg ccc gcc gcc gcc 866 
Ala Ala Pro Pro Ala Gly Pro Arg Arg Ala Pro Pro Pro Ala Ala Ala 
195 ' 200 205 

gtc gcc gcc egg gag teg ccg ctg ccg ccg ccg cca cag ccg ccg gcg 
Val Ala Ala Arg Glu Ser Pro Leu Pro Pro Pro Pro Gin Pro Pro Ala 
210 " 215 220 

ccg cca cag cag cag cag cag ccg ccg ccg cca ccg ccg ccg cag cag 
Pro Pro Gin Gin Gin Gin Gin Pro Pro Pro Pro Pro Pro Pro Gin Gin 
225 230 235 

cca cag ccg ccg ccg gag ggg ggc gcg gcg egg gcc ggc ggc ccg gcg 
Pro Gin Pro Pro Pro Glu Gly Gly Ala Ala Arg Ala Gly Gly Pro Ala 
240 245 250 255 

egg ccc gtg age ctg egg gaa gtc gtg cgc tac etc ggg ggt age age 
Arq Pro Val Ser Leu Arg Glu Val Val Arg Tyr Leu Gly Gly Ser Ser 
260 265 270 

ggc get ggc ggc cgc ctg acc cgc ggc cgc gtg cag ggt ctg ctg gaa 
Gly Ala Gly Gly Arg Leu Thr Arg Gly Arg Val Gin Gly Leu Leu Glu 
275 280 285 

gag gag gcg gcg gcg egg ggc cgc ctg gag cgc acc cgt etc gga gcg 1154 
Glu Glu Ala Ala Ala Arg Gly Arg Leu Glu Arg Thr Arg Leu Gly Ala 
290 295 300 
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ctt gcg ctg ccc cgc ggg gac agg ccc gga egg gcg cca ccg gcc gcc 
Leu Ala Leu Pro Arg Gly Asp Arg Pro Gly Arg Ala Pro Pro Ala Ala 
305 310 315 

age gcc cgc gcg gcg egg aac aag aga get ggc gag gag cga gtg ctt 
Ser Ala Arg Ala Ala Arg Asn Lys Arg Ala Gly Glu Glu Arg Val Leu 
320 ' 325 330 335 

gaa aag gag gag gag gag gag gag gag gaa gac gac gag gac gac gac 1298 
Glu Lys Glu Glu Glu Glu Glu Glu Glu Glu Asp Asp Glu Asp Asp Asp 
340 345 350 

gac gac gtc gtg tec gag ggc teg gag gtg ccc gag age gat cgt ccc 1346 
Asp Asp Val Val Ser Glu Gly Ser Glu Val Pro Glu Ser Asp Arg Pro 
355 360 365 

gcg ggt gcg cag cat cac cag ctg aat ggc ggc gag cgc ggc ccg cag 1394 
Ala Gly Ala Gin His His Gin Leu Asn Gly Gly Glu Arg Gly Pro Gin 
370 375 380 



acc gcc aag gag egg gcc aag gag tgg teg ctg tgt ggc ccc cac cct 
Thr Ala Lys Glu Arg Ala Lys Glu Trp Ser Leu Cys Gly Pro His Pro 

390 395 



385 



ggc cag gag gaa ggg egg ggg ccg gcc gcg ggc agt ggc acc cgc cag 
Glv Gin Glu Glu Gly Arg Gly Pro Ala Ala Gly Ser Gly Thr Arg Gin 

405 410 415 



1442 



1490 



400 



FIG. 2A-2 
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qtq ttc tec atg gcg gec ttg agt aag gag ggg gga tea gec tct teg 
Val Phe Ser Met Ala Ala Leu Ser Lys Glu Gly Gly Ser Ala Ser Ser 
420 425 430 

acc ace qgg cct gac tec ccg tec ccg gtg cct ttg ccc ccc ggg aag 
Thr Thr Gly Pro Asp Ser Pro Ser Pro Val Pro Leu Pro Pro Gly Lys 
435 440 445 

cca gec etc cca gga gee gat ggg acc ccc ttt ggc tgc cct gee ggg 
Pro Ala Leu Pro Gly Ala Asp Gly Thr Pro Phe Gly Cys Pro Ala Gly 
450 455 460 

cgc aaa gag aag ccg gca gac ccc gtg gag tgg aca gtc atg gac gtc 
Arq Lys Glu Lys Pro Ala Asp Pro Val Glu Trp Thr Val Met Asp Val 
465 " 470 475 

qtq gag tac ttc acc gag gcg ggc ttc cct gag caa gec acg get ttc 
Val Glu Tyr Phe Thr Glu Ala Gly Phe Pro Glu Gin Ala Thr Ala Phe 
480 485 490 495 

cag gag cag gag ate gac ggc aag tec ctg ctg etc atg cag cgc acc 
Gin Glu Gin Glu He Asp Gly Lys Ser Leu Leu Leu Met Gin Arg Thr 
500 505 510 

qat qtc etc acc ggc ctg tec ate cgc ctg ggg cca gcg ttg aaa ate 
Asp Val Leu Thr Gly Leu Ser lie Arg Leu Gly Pro Ala Leu Lys He 
515 520 525 

tat qag cac cat ate aag gtg ctg cag cag ggt cac ttc gag gac gat 
Tvr Glu His His He Lys Val Leu Gin Gin Gly His Phe Glu Asp Asp 
530 535 540 

gac ccg gaa ggc ttc ctg gga t gagcacagag ccgccgcgcc ccttgtcccc 
Asp Pro Glu Gly Phe Leu Gly 
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acccccaccc cgcctggacc cattcctgcc tccatgtcac ccaaggtgtc ccagaggcca 
qqaqctqgac tgggcaggcg aggggtgegg acctaccctg attctggtag ggggegggge 
cttactgtgc teattgetae ccccccaccc cgtgtgtgtc tctgcacctg cccccagcac 
acccctcccg gagectggat gtcgcctggg actctggcct gctcattttg cccccagatc 
agccccctcc ctccctcctg tcccaggaca ttttttaaaa gaaaaaaagg aaaaaaaaaa 
attggggagg gggctgggaa ggtgccccaa gatcctcctc ggcccaacca ggtgtttatt 
cctatatata tatatatatg ttttgttctg cctgtttttc gttttttggt gcgtggcctt 
tcttccctcc caccaccact catggcccca gccctgctcg ccctgtcggc gggagcagct 
gggaatggga ggagggtggg accttgggtc tgtctcccac cctctctccc Jttggttctg 
ttgtcgctcc agctggctgt attgettttt aatattgeae cgaagggttg tttttttttt 2526 
tttaaataaa attttaaaaa aaggaaaaaa aaaaa 

FIG. 2A-3 
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asn asn arg Xaa pro lys 
gly gly pro ala arg pro 
val arg tyr leu gly gly 
arg leu thr arg gly arg 
glu glu ala ala ala arg 
arg leu gly ala leu ala 
pro gly arg ala pro pro 
ala arg asn lys arg ala 
glu lys glu glu glu glu 
glu asp asp asp asp asp 
glu val pro glu ser asp 
his his gin leu asn gly 
thr ala lys glu arg ala 
gly pro his pro gly gin 
ala ala gly ser gly thr 
ala ala leu ser lys glu 
thr thr gly pro asp ser 
pro pro gly lys pro ala 
thr pro phe gly cys pro 
pro ala asp pro val glu 
val glu tyr phe thr glu 
ala thr ala phe gin glu 
ser leu leu leu met gin 
gly leu ser ile arg leu 
tyr glu his his ile lys 
phe glu asp asp asp pro 

FIG. 2B 
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ala ser ala arg ala 
gly glu glu arg val leu 
glu glu glu glu asp asp 
val val ser glu gly ser 
arg pro ala gly ala gin 
gly glu arg gly pro gin 
lys glu trp ser leu cys 
glu glu gly arg gly pro 
arg gin val phe ser met 

giy gly ser ala ser ser 

pro ser pro val pro leu 
leu pro gly ala asp gly 
ala gly arg lys glu lys 
trp thr val met asp val 
ala gly phe pro glu gin 
gin glu ile asp gly lys 
arg thr asp val leu thr 
gly pro ala leu lys ile 
val leu gin gin gly his 
glu gly phe leu gly 



ala arg asn lys arg ala 
glu lys glu glu glu glu 
glu asp asp asp asp asp 
glu val pro glu ser asp 
his his gin leu asn gly 
thr ala lys glu arg ala 
gly pro his pro gly gin 
ala ala gly ser gly thr 
ala ala leu ser lys glu 
thr thr gly pro asp ser 
pro pro gly lys pro ala 
thr pro phe gly cys pro 
pro ala asp pro val glu 
val glu tyr phe thr glu 
ala thr ala phe gin glu 
ser leu leu leu met gin 
gly leu ser ile arg leu 
tyr glu his his ile lys 
phe glu asp asp asp pro 

FIG. 3 
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met lys asn gin 

asp lys lys asn gly ala ala lys gin pro asn pro 
lys ser ser pro gly gin pro glu ala gly ala glu 
gly ala gin gly arg pro gly arg pro ala pro ala 
arg glu ala glu gly ala ser ser gin ala pro gly 
arg pro glu gly ala gin ala lys thr ala gin pro 
gly ala leu cys asp val ser glu glu leu ser arg 
gin leu glu asp ile leu ser thr tyr cys val asp 
asn asn gin gly ala pro gly glu asp gly val gin 
gly glu pro pro glu pro glu asp ala glu lys ser 
arg ala tyr val ala arg asn gly glu pro glu pro 
gly thr pro val val asn gly glu lys glu thr ser 
lys ala glu pro gly thr glu glu ile arg thr ser 
asp glu val gly asp arg asp his arg arg pro gin 
glu lys lys lys ala lys gly leu gly lys glu ile 
thr leu leu met gin thr leu asn thr leu ser thr 
pro glu glu lys leu ala ala leu cys lys lys tyr 
ala glu leu leu glu glu his arg asn ser gin lys 
gin met lys leu leu gin lys lys gin ser gin leu 
val gin glu lys asp his leu arg gly glu his ser 
lys ala ile leu ala arg ser lys leu glu ser leu 
cys arg glu leu gin arg his asn arg ser leu lys 
glu glu gly val gin arg ala arg glu glu glu glu 
lys arg lys glu val thr ser his phe gin met thr 
leu asn asp ile gin leu gin met glu gin his asn 
glu arg asn ser lys leu arg gin glu asn met glu 

FIG. 5A 
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leu ala glu arg leu lys iys leu ile glu gin tyr 
glu leu arg glu glu his ile asp lys val phe lys 
his lys asp leu gin gin gin leu val asp ala lys 
leu gin gin ala gin glu met leu lys glu ala glu 
glu arg his gin arg glu lys asp phe leu leu lys 
glu ala val glu ser gin arg met cys glu leu met 
lys gin gin glu thr his leu lys gin gin leu ala 
leu tyr thr glu lys phe glu glu phe gin asn thr 
leu ser lys ser ser glu val phe thr thr phe lys 
gin glu met glu lys met thr lys iys ile iys lys 
leu glu lys glu thr thr met tyr arg ser arg trp 
glu ser ser asn lys ala leu leu glu met ala glu 
glu lys thr leu arg asp lys glu leu glu gly leu 
gin val lys ile gin arg leu glu lys leu cys arg 
ala leu gin thr glu arg asn asp leu asn lys arg 
val gin asp leu ser ala gly gly gin gly pro val 
ser asp ser gly pro glu arg arg pro glu pro ala 
thr thr ser lys glu gin gly val glu gly pro gly 
ala gin val pro asn ser pro arg ala thr asp ala 
ser cys cys ala gly ala pro ser thr glu ala ser 
gly gin thr gly pro gin glu pro thr thr ala thr 
ala 

FIG. 5B 
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atg gcg ggg ccc ccg gcc eta ccc ccg ccg gag acg gcg gcg gec gec 
Met Ala Gly Pro Pro Ala Leu Pro Pro Pro Glu Thr Ala Ala Ala Ala 
I 5 10 15 

acc acg gcg gcc gcc gcc teg teg tec gcc get tec ccg cac tac caa 
Thr Thr Ala Ala Ala Ala Ser Ser Ser Ala Ala Ser Pro His Tyr Gin 
20 25 30 

gag tgg ate ctg gac acc ate gac teg ctg cgc teg cgc aag gcg egg 
Glu Trp He Leu Asp Thr He Asp Ser Leu Arg . Ser Arg Lys Ala Arg 
35 40 45 

ccg gac ctg gag cgc ate tgc egg atg gtg egg egg egg cac ggc ccg 
Pro Asp Leu Glu Arg He Cys Arg Met Val Arg Arg Arg His Gly Pro 
50 55 60 

gag ccg gag cgc acg cgc gcc gag etc gag aaa ctg ate cag cag cgc 
Glu Pro Glu Arg Thr Arg Ala Glu Leu Glu Lys Leu He Gin Gin Arg 
65 70 75 80 

gcc gtg etc egg gtc age tac aag ggg age ate teg tac cgc aac gcg 
Ala Val Leu Arg Val Ser Tyr Lys Gly Ser He Ser Tyr Arg Asn Ala 
85 90 95 



cgc gcc ccc cgc ggg gcc ccc gcc gcc gcc gcc gcc gcc gcg ccg ccg 
Arg Ala Pro Arg Gly Ala Pro Ala Ala Ala Ala Ala Ala Ala Pro Pro 
115 * 120 125 

ccc acg ccc gcc ccg ccg cca ccg ccc gcg ccc gtc gcc gcc gcc gcc 
Pro Thr Pro Ala Pro Pro Pro Pro Pro Ala Pro Val Ala Ala Ala Ala 
130 135 140 

ccg gcc egg gcg ccc cgc gcg gcc gcc gcc gcc gcc aca gcg ccc ccc 
Pro Ala Arg Ala Pro Arg Ala Ala Ala Ala Ala Ala Thr Ala Pro Pro 
145 150 155 160 

teg cct ggc ccc gcg cag ccg ggc ccc cgc gcg cag egg gcc gcg ccc 
Ser Pro Gly Pro Ala Gin Pro Gly Pro Arg Ala Gin Arg Ala Ala Pro 
165 170 175 

ctg gcc gcg ccg ccg ccc gcg cca gcc get ccc ccg gcg gtg gcg ccc 
Leu Ala Ala Pro Pro Pro Ala Pro Ala Ala Pro Pro Ala Val Ala Pro 
180 185 190 

ccg gcc ggc ccg cgc cgc gcc ccc ccg ccc gcc gtc gcc gcc egg gag 
Pro Ala Gly Pro Arg Arg Ala Pro Pro Pro Ala Val Ala Ala Arg Glu 
195 " 200 205 

ccg ccg ctg ccg ccg ccg cca cag ccg ccg gcg ccg cca cag cag cag 
Pro Pro Leu Pro Pro Pro Pro Gin Pro Pro Ala Pro Pro Gin Gin Gin 
210 215 220 
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100 105 HO 
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cag ccg ccg ccg ccg cag cca cag ccg ccg ccg gag ggg ggc gcg gtg 720 
Gin Pro Pro Pro Pro Gin Pro Gin Pro Pro Pro Glu Gly Gly Ala Val 
225 230 235 240 

egg gcc ggc ggc gcg gcg egg ccc gtg age ctg egg gaa gtc gtg cgc 
Ara Ala Gly Gly Ala Ala Arg Pro Val Ser Leu Arg Glu Val Val Arg 
245 250 255 

tac etc ggg ggc age ggc ggc gcc ggc ggt cgc eta ace cgc ggc cgc 
Tvr Leu Gly Gly Ser Gly Gly Ala Gly Gly Arg Leu Thr Arg Gly Arg 
260 265 270 

gtg cag ggg ctg ctg gag gag gag gcg gcg get cga ggc cgt ctg gag 
Val Gin Gly Leu Leu Glu Glu Glu Ala Ala Ala Arg Gly Arg Leu Glu 
275 280 285 

cgc ace cgt etc gga gcg ctt gcg ctg ccc cgc ggg gac agg ccc gga 912 
Arg Thr Arg Leu Gly Ala Leu Ala Leu Pro Arg Gly Asp Arg Pro Gly 
290 " 295 300 

egg gcg ccg ccg gcc gcc age gcc cgc ccg tct cgc age aag aga ggt 
Ara Ala Pro Pro Ala Ala Ser Ala Arg Pro Ser Arg Ser Lys Arg Gly 

310 315 320 
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ccg cac cag ggc cag gat gaa ggg egg ggg cca gcc ccg ggc age ggc 

Pro His Gin Gly Gin Asp Glu Gly Arg Gly Pro Ala Pro Gly Ser Gly 

385 * 390 395 400 

ace cgc cag gtg ttc tec atg gca gcc atg aac aag gaa ggg gga aca 

Thr Arg Gin Val Phe Ser Met Ala Ala Met Asn Lys Glu Gly Gly Thr 

405 410 415 

get tct gtt gcc ace ggg cca gac tec ccg tec ccc gtg cct ttg ccc 

Ala Ser Val Ala Thr Gly Pro Asp Ser Pro Ser Pro Val Pro Leu Pro 

420 425 430 



768 



816 



864 



960 



1008 



gga gaa gag cga gta ctt gag aaa gaa gag gaa gaa gat gat gat gaa 
Glv Glu Glu Arq Val Leu Glu Lys Glu Glu Glu Glu Asp Asp Asp Glu 
Y 325 " 330 335 

gat gaa gat gaa gaa gat gat gtg tea gag ggc tct gaa gtg ccc gag 
Asp Glu Asp Glu Glu Asp Asp Val Ser Glu Gly Ser Glu Val Pro Glu 
340 345 350 

agt gac cgt cct gca ggt gcc cag cac cac cag ctt aac ggc gag egg 
Ser Asp Arg Pro Ala Gly Ala Gin His His Gin Leu Asn Gly Glu Arg 
355 360 365 

gga cct cag agt gcc aag gag agg gtc aag gag tgg ace ccc tgc gga 1152 
Gly Pro Gin Ser Ala Lys Glu Arg Val Lys Glu Trp Thr Pro Cys Gly 
370 375 380 



1056 



1104 



1200 



1248 



1296 



1344 



cca ggc aaa cca gcc eta cct ggg gcc gac ggg ace ccc ttt ggc tgt 
Pro Gly Lys Pro Ala Leu Pro Gly Ala Asp Gly Thr Pro Phe Gly Cys 

«» FIG. 7A-2 
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ccg ccc ggg cgc aaa gag aag cca tct gat ccc gtc gag tgg acc gtg 
Pro Pro Gly Arg Lys Glu Lys Pro Ser Asp Pro Val Glu Trp Thr Val 
450 " 455 460 

atg gat gtc gtc gaa tat ttt act gag get gga ttc ccg gag cag gcg 
Met Asp Val Val Glu Tyr Phe Thr Glu Ala Gly Phe Pro Glu Gin Ala 
465 470 475 480 

aca get ttc caa gag cag gaa att gat ggc aaa tct ttg ctg etc atg 
Thr Ala Phe Gin Glu Gin Glu lie Asp Gly Lys Ser Leu Leu Leu Met 
485 490 495 

cag cgc aca gat gtg etc acc ggc ctg tec ate cgc etc ggg cca gee 
Gin Arg Thr Asp Val Leu Thr Gly Leu Ser He Arg Leu Gly Pro Ala 
500 505 ( 510 

ctg aaa ate tac gag cac cac ate aag gtg ctt cag caa ggc cac ttt 
Leu Lys lie Tyr Glu His His He Lys Val Leu Gin Gin Gly His Phe 
515 520 525 



gag gat gat gac ccc gat ggc ttc tta ggc 
Glu Asp Asp Asp Pro Asp Gly Phe Leu Gly 
530 535 



FIG. 7A-3 
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glu glu arg val leu glu lys glu glu glu giu asp 
asp asp glu asp glu asp glu glu asp asp val ser 
glu gly ser glu val pro glu ser asp arg pro ala 
gly ala gin his his gin leu asn gly glu arg gly 
pro gin ser ala lys glu arg val lys glu trp thr 
pro cys gly pro his gin gly gin asp glu gly arg 
gly pro ala pro gly ser gly thr arg gin val phe 
ser met ala ala met asn lys glu gly gly thr ala 
ser val ala thr gly pro asp ser pro ser pro val 
pro leu pro pro gly lys pro ala leu pro gly ala 
asp gly thr pro phe gly cys pro pro gly arg lys 
glu lys pro ser asp pro val glu trp thr val met 
asp val val glu tyr phe thr glu ala gly phe pro 
glu gin ala thr ala phe gin glu gin glu ile asp 
gly lys ser leu leu leu met gin arg thr asp val 
leu thr gly leu ser ile arg leu gly pro ala leu 
lys ile tyr glu his his ile lys val leu gin gin 
gly his phe glu asp asp asp pro asp gly phe leu 



gly 



FIG. 



7B 
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atg aag aac caa gac aaa aag aac ggg get gec aaa caa tec aat cca 
Met Lys Asn Gin Asp Lys Lys Asn Gly Ala Ala Lys Gin Ser Asn Pro 



1 



5 " " 10 15 



age cag get cct egg aag ccg gag ggt get caa gec aga acg get cag 
Ser Gin Ala Pro Arg Lys Pro Glu Gly Ala Gin Ala Arg Thr Ala Gin 
50 55 60 , 



gtc aat gga gag aag gaa ccc tec aag ggg gat cca aac aca gaa gag 
Val Asn Gly Glu Lys Glu Pro Ser Lys Gly Asp Pro Asn Thr Glu Glu 
130 " 135 140 



gag aag aaa aaa gec aag ggt ttg ggt aag gag ate acg ttg ctg atg 

Glu Lys Lys Lys Ala Lys Gly Leu Gly Lys Glu He Thr Leu Leu Met 
165 170 175 

cag aca ttg aat act ctg agt ace cca gag gag aag ctg get get ctg 

Gin Thr Leu Asn Thr Leu Ser Thr Pro Glu Glu Lys Leu Ala Ala Leu 
180 185 190 

tgc aag aag tat get gaa ctg ctg gag gag cac egg aat tea cag aag 

Cys Lys Lys Tyr Ala Glu Leu Leu Glu Glu His Arg Asn Ser Gin Lys 
195 200 205 



FIG. 8A-1 



48 



96 



aaa age age cca gga caa ccg gaa gca gga ccc gag gga gee cag gag 
Lys Ser Ser Pro Gly Gin Pro. Glu Ala Gly Pro Glu Gly Ala Gin Glu 
20 * 25 30 

egg ccc age cag gcg get cct gca gta gaa gca gaa ggt ccc ggc age 14 4 

Arg Pro Ser Gin Ala Ala Pro Ala Val Glu Ala Glu Gly Pro Gly Ser 
35 40 45 



192 



240 



288 



tct ggg gee ctt cgt gat gtc tct gag gag ctg age cgc caa ctg gaa 
Ser Gly Ala Leu Arg Asp Val Ser Glu Glu Leu Ser Arg Gin Leu Glu 
65 ' 70 75 80 

gac ata ctg age aca tac tgt gtg gac aat aac cag ggg ggc ccc ggc 
Asp He Leu Ser Thr Tyr Cys Val Asp Asn Asn Gin Gly Gly Pro Gly 
85 90 95 

gag gat ggg gca cag ggt gag ccg get gaa ccc gaa gat gca gag aag 336 
Glu Asp Gly Ala Gin Gly Glu Pro Ala Glu Pro Glu Asp Ala Glu Lys 
100 105 HO 

tec egg acc tat gtg gca agg aat ggg gag cct gaa cca act cca gta 384 
Ser Arg Thr Tyr Val Ala Arg Asn Gly Glu Pro Glu Pro Thr Pro Val 
115 120 125 



432 



ate egg cag agt gac gag gtc gga gac cga gac cat cga agg cca cag 480 
He Arg Gin Ser Asp Glu Val Gly Asp Arg Asp His Arg Arg Pro Gin 
145 150 155 160 



528 



576 



624 
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cag atg aag etc eta cag aaa aag cag age cag ctg gtg caa gag aag 
Gin Met Lys Leu Leu Gin Lys Lys Gin Ser Gin Leu Val Gin Glu Lys 
210 215 220 

gac cac ctg cgc ggt gag cac age aag gec gtc ctg gee cgc age aag 
Asp His Leu Arg Gly Glu His Ser Lys Ala Val Leu Ala Arg Ser Lys 
225 230 235 240 



caa cag ctt gee eta tac aca gag aag ttt gag gag ttc cag aac aca 

Gin Gin Leu Ala Leu Tyr Thr Glu Lys Phe Glu Glu Phe Gin Asn Thr 

385 390 395 400 

ctt tec aaa age age gag gta ttc acc aca ttc aag cag gag atg gaa 

Leu Ser Lys Ser Ser Glu Val Phe Thr Thr Phe Lys Gin Glu Met Glu 
405 410 415 

FIG. 8A-2 



672 



720 



7 68 



ctt gag age eta tgc cgt gag ctg cag egg cac aac cgc tec etc aag 
Leu Glu Ser Leu Cys Arg Glu Leu Gin Arg His Asn Arg Ser Leu Lys 
245 250 255 

gaa gaa ggt gtg cag egg gee egg gag gag gag gag aag cgc aag gag 816 
Glu Glu Gly Val Gin Arg Ala Arg Glu Glu Glu Glu Lys Arg Lys Glu 
260 265 270 

gtg acc teg cac ttc cag gtg aca ctg aat gac att cag ctg cag atg 864 
Val Thr Ser His Phe Gin Val Thr Leu Asn Asp He Gin Leu Gin Met 
275 280 285 

gaa cag cac aat gag cgc aac tec aag ctg cgc caa gag aac atg gag 
Glu Gin His Asn Glu Arg Asn Ser Lys Leu Arg Gin Glu Asn Met Glu 
290 295 300 

ctg get gag agg etc aag aag ctg att gag cag tat gag ctg cgc gag 
Leu Ala Glu Arg Leu Lys Lys Leu He Glu Gin Tyr Glu Leu Arg Glu 
305 ' 310 315 320 

gag cat ate gac aaa gtc ttc aaa cac aag gac eta caa cag cag ctg 
Glu His He Asp Lys Val Phe Lys His Lys Asp Leu Gin Gin Gin Leu 
325 330 335 

gtg gat gee aag etc cag cag gee cag gag atg eta aag gag gca gaa 
Val Asp Ala Lys Leu Gin Gin Ala Gin Glu Met Leu Lys Glu Ala Glu 
340 345 350 

gag egg cac cag egg gag aag gat ttt etc ctg aaa gag gca gta gag 
Glu Arg His Gin Arg Glu Lys Asp Phe Leu Leu Lys Glu Ala Val Glu 
355 360 365 

tec cag agg atg tgt gag ctg atg aag cag caa gag acc cac ctg aag 1152 
Ser Gin Arg Met Cys Glu Leu Met Lys Gin Gin Glu Thr His Leu Lys 
370 J 375 380 



912 



960 



1008 



1056 



1104 



1200 



1248 
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agg gcc 
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lys ser ser pro gly gin pro glu ala gly pro glu gly ala 
gin glu arg pro ser gin ala ala pro ala val glu ala glu gly 
pro gly ser ser gin ala pro arg lys pro glu gly ala gin ala 
arg thr ala gin ser gly ala leu arg asp val ser glu glu leu 
ser arg gin leu glu asp ile leu ser thr tyr cys val asp asn 
asn gin gly gly pro gly glu asp gly ala gin gly glu pro ala 
glu pro glu asp ala glu lys ser arg thr tyr val ala arg asn 
gly glu pro glu pro thr pro val val tyr gly glu lys glu pro 
ser lys gly asp pro asn thr glu glu ile arg gin ser asp glu 
val gly asp arg asp his arg arg pro gin glu lys lys lys ala 
lys gly leu gly lys glu ile thr leu leu met gin thr leu asn 
thr leu ser thr pro glu glu lys leu ala ala leu cys lys lys 
tyr ala glu leu leu glu glu his arg asn ser gin lys gin met 
lys leu leu gin lys lys gin ser gin leu val gin glu lys asp 
his leu arg gly glu his ser lys ala val leu ala arg ser lys 
leu glu ser leu cys arg glu leu gin arg his asn arg ser leu 
lys glu glu gly val gin arg ala arg glu glu glu glu lys arg 
lys glu val thr ser his phe gin val thr leu asn asp ile gin 
leu gin met glu gin his asn glu arg asn ser lys leu arg gin 
glu asn met glu leu ala glu arg leu lys lys leu ile glu gin 
tyr glu leu arg glu glu his ile asp lys val phe lys his lys 
asp leu gin gin gin leu val asp ala lys leu gin gin ala gin 
glu met leu lys glu ala glu glu arg his gin arg glu lys asp 
phe leu leu lys glu ala val glu ser gin arg met cys glu leu 
met lys gin gin glu thr his leu lys gin gin leu ala leu tyr 
thr glu lys phe glu glu phe gin asn thr leu ser lys ser ser 

FIG. 8B-1 
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glu val phe thr thr phe lys gin glu met glu lys met thr lys 
lys ile lys lys' leu glu lys glu thr thr met tyr arg ser arg 
trp glu ser ser asn lys ala leu leu glu met ala glu glu lys 
thr val arg asp lys glu leu glu gly leu gin val lys ile gin 
arg leu glu lys leu cys arg ala leu gin thr glu arg asn asp 
leu asn lys arg val gin asp leu ser ala gly gly gin gly ser 
leu thr asp ser gly pro glu arg arg pro glu gly pro gly ala 
gin ala pro ser ser pro arg val thr glu ala pro cys tyr pro 
gly ala pro ser thr glu ala ser gly gin thr gly pro gin glu 
pro thr ser ala arg ala *** 

FIG. 8B-2 
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val asp val asp 

glu tyr asp glu asn lys phe val asp glu glu asp 

giy giy as P siy 

FIG. 9 
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n AAG CCT CGC AGC GGT CGG GGC GGC GCC GCG GAG GCT 

3 ^ S 2 coo cog oca oco oco « too aag aac ago 

„ GTG TCG TCG GCG COG TTC CGG AAG GTG GAC GTG OAT 
vlx ser ser ala arg phe arg lys val asp val asp 

no TAG TAC GAC GAG AAC AAG TTC GTG GAC GAG GAA GAC 
109 glu tyr asp gl» asn lys phe val asp glu glu asp 

,« BSC GGC GAC GGC CAG GCG GGG CCG GAC GAG GGC GAG 

145 ^ g?y asp gly gin ala gly Pro asp glu gly glu 

181 GTG GAC TCG TOC CTO COO CAA GGG AAC ATO ACA GCC 
val asp ser cys leu arg gin gly asn met thr ala 

, n 7 GCC CTO CAG GCG GCG CTO AAG AAC CCT CCC ATC AAC 
21 aS reu gin ala ala leu lys asn pre pro rle asn 

253 ACC AGO AGC CAG OCO GTG AAG GAC CGO OCA GGC AGC 
thr arg ser gin ala val lys asp arg ala gly ser 

289 ATC GTG CTG AAG GTG CTC ATC TCC TTC AAG GCC GGC 
tie val leu lys val leu lie ser phe lys ala gly 

„s GAC ATA GAA AAG GCC GTG CAG TCC CTG GAC AGO AAC 
asp Ue glu lys ala val gin ser leu asp arg asn 

,r, CGC GTG GAC CTG CTC ATG AAG TAC ATC TAC AAG GGC 
3 " g!y vlr asp leu leu met lys tyr lie tyr lys gly 

" - 2 2 pro 2 2 2 2 2 2 2 2 

« S gfn 2 2 2 - 2 2 2 2 2 2 
« S 2 2 2 2 arg 2 2 2 2 2 2 

50S ACC GTG TAG CCT GGC AGG AAC GGG TGC CTG CCG GGG 
thr val . 

FIG. 10A 
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541 
577 
613 
649 
685 
721 
757 
793 
829 
865 
901 
937 
973 
1009 
1045 
1081 
1117 
1153 
1189 
1225 
1261 
1297 
1333 
1369 



AGC GGG 
ATG CCG 
GCT TCG 
ACA TAA 
AGA GGG 
TGT AAG 
TAG CGT 
GGG ATC 
CAT TGC 
AAA AAA 
CTC TTC 
CAG TGG 
GCT CAA 
CTG GGC 
GCA TGC 
GCC CCA 
CCT CAG 
TTC TTG 
CTA CCG 
GCC GGC 
TCC TGG 
AAT TCC 
TTT GTT 
ATA AAA 



AGC TGC 
CCG CTG 
CTT TTT 
GGT TTC 
TCC TGT 
ACC CCT 
GTC GCC 
TGG TCA 
TGG TCA 
AGA CTG 
CCT TAT 
TGC CCC 
GGG AAG 
AGG AGC 
GCG GTC 
GTG CCC 
AAG ACT 
GTT CAG 
TGG GTA 
GCT TGA 
GAG ACA 
ACA CAC 
TTG TTT 
CGT CAG 



CGG 
CCC 
CCC 
CCT 
TCA 
GCC 
GGC 
GAA 
GGA 
GCA 
TTA 
CCC 
CCT 
GCC 
ACG 
CGG 
GGA 
CTC 
GAG 
CGG 
GGC 
TCG 
TGT 
TGT 



TAC AAA 
TGT GGG 
TTT CCC 
TCG TGA 
TAC GCG 
TTC AGT 
TGG GTT 
TTT GAG 
AAT GAT 
GCT ATT 
AGC AGA 
CCC CGC 
CCC TGC 
GCG TCC 
CGG CAC 
AGT TCT 
TCC GAT 
AGT GTT 
CAC ACT 
TTG TCA 
GAG CGC 
TGC TGT 
AAA AAT 
TGA CAA 



GAC CAA 
CGG CGT 
GTG TCT 
GAA TCG 
CCG TGC 
GTC CTT 
TGG TTT 
GCC AGT 
CTT CAT 
ATG CAA 
GTG AGT 
CCC GGC 
AGA GCA 
CGC TCC 
TTC CTG 
TCA GGG 
CCA GAC 
TTC AAA 
GGT TCT 
CAG CTC 
GAG TCG 
TTT TGA 
GAT GCA 
AAA AAA 



AAC GCC 
CTG TTC 
GTC AGG 
GAG TGG 
GTC CGG 
GAG CAA 
TGT CGT 
TTC CTA 
TTA AAA 
AAC TGG 
TTC TGG 
CGC CGT 
GCA GAG 
CAG GAG 
TGC CTC 
CGA CAG 
AGA CGC 
AGG AAC 
CAG AAC 
CAG AAC 
CCG GGC 
TAC CTG 
CTT GAG 
AAA AAA 



CAG 
CCA 
ATT 
CGC 
CTG 
CGG 
GGA 
ACT 
AAA 
ACC 
AAC 



CCT 

ccc 

ACA 

CCA 

GGA 

CCA 

GTG 

ACG 

AAA 

AGG 

CTT 

AAA 

AAA 



FIG. 10B 
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1 GAC TGC CGC AGC AGC AGC 
asp cys arg ser ser ser 

37 GGT GGC GCG GCG CGG GCC 
gly gly aia ala arg ala 

73 GTG AGC CTG CGG GAA GTC 
val ser leu arg glu val 

109 AGC AGC GGC GCT GGC GGC 
ser ser gly ala gly gly 

145 GTG CAG GGT CTG CTG GAA 
val gin gly leu leu glu 

181 GGC CGC CTG GAG CGC ACC 
gly arg leu glu arg thr 

217 CTG CCC CGC GGG GAC AGG 
leu pro arg gly asp arg 

253 GCC GCC AGC GCC CGC GCG 
ala ala ser ala arg ala 

28 9 GGC GAG GAG CGA GTG CTT 
gly glu glu arg val leu 

325 GAG GAG GAG GAA GAC GAC 
glu glu glu glu asp asp 

361 GTC GTG TCC GAG GGC TCG 
val val ser glu gly ser 

3 97 CGT CCC GCG GGT GCG CAG 
arg pro ala gly ala gin 

433 GGC GAG CGC GGC CCG CAG 
gly glu arg gly pro gin 

469 AAG GAG TGG TCG CTG TGT 
lys glu trp ser leu cys 

FIG. 



AAC AAC CGC TAG CCG AAG 
asn asn arg Xaa pro lys 

GGC GGC CCG GCG CGG CCC 

gly gly P ro ala ar 9 P ro 

GTG CGC TAC CTC GGG GGT 
val arg tyr leu gly gly 

CGC CTG ACC CGC GGC CGC 
arg leu thr arg gly arg 

GAG GAG GCG GCG GCG CGG 
glu glu ala ala ala arg 

CGT CTC GGA GCG CTT GCG 
arg leu gly ala leu ala 

CCC GGA CGG GCG CCA CCG 
pro gly arg ala pro pro 

GCG CGG AAC AAG AGA GCT 
ala arg asn lys arg ala 

GAA AAG GAG GAG GAG GAG 
glu lys glu glu glu glu 

GAG GAC GAC GAC GAC GAC 
glu asp asp asp asp asp 

GAG GTG CCC GAG AGC GAT 
glu val pro glu ser asp 

CAT CAC CAG CTG AAT GGC 
his his gin leu asn gly 

ACC GCC AAG GAG CGG GCC 
thr ala lys glu arg ala 

GGC CCC CAC CCT GGC CAG 
gly pro his pro gly gin 

11A 
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505 GAG GAA GGG CGG GGG CCG GCC GCG GGC AGT GGC ACC 
glu glu gly arg gly pro ala ala gly ser gly thr 

541 CGC CAG GTG TTC TCC ATG GCG GCC TTG AGT AAG GAG 
arg gin val phe ser met ala ala leu ser lys glu 

577 GGG GGA TCA GCC TCT TCG ACC ACC GGG CCT GAC TCC 
gly gly ser ala ser ser thr thr gly pro asp ser 

613 CCG TCC CCG GTG CCT TTG CCC CCC GGG AAG CCA GCC 
pro ser pro val pro leu pro pro gly lys pro ala 

649 CTC CCA GGA GCC GAT GGG ACC CCC TTT GGC TGC CCT 
leu pro gly ala asp gly thr pro phe gly cys pro 

685 GCC GGG CGC AAA GAG AAG CCG GCA GAC CCC GTG GAG 
ala gly arg lys glu lys pro ala asp pro val glu 

721 TGG ACA GTC ATG GAC GTC GTG GAG TAG TTC ACC GAG 
trp thr val met asp val val glu tyr phe thr glu 

757 GCG GGC TTC CCT GAG CAA GCC ACG GCT TTC CAG GAG 
ala gly phe pro glu gin ala thr ala phe gin glu 

793 CAG GAG ATC GAC GGC AAG TCC CTG CTG CTC ATG CAG 
gin glu ile asp gly lys ser leu leu leu met gin 

829 CGC ACC GAT GTC CTC ACC GGC CTG TCC ATC CGC CTG 
arg thr asp val leu thr gly leu ser ile arg leu 

865 GGG CCA GCG TTG AAA ATC TAT GAG CAC CAT ATC AAG 
gly pro ala leu lys ile tyr glu his his lie lys 

901 GTG CTG CAG CAG GGT CAC TTC GAG GAC GAT GAC CCG 
val leu gin gin gly his phe glu asp asp asp pro 

937 GAA GGC TTC CTG GGA TGA GCA CAG AGC CGC CGC GCC 
glu gly phe leu gly 

973 CCT TGT CCC CAC CCC CAC CCC GCC TGG ACC CAT TCC 

1009 TGC CTC CAT GTC ACC CAA GGT GTC CCA GAG GCC AGG 

[III AGC TGG ACT GGG CAG GCG AGG GGT GCG GAC CTA CCC 

1081 TGA TTC TGG TAG GGG GCG GGG CCT TGC TGT GCT CAT 

FIG. 11B 
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AAG 


AAA 


AAA 
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AAA 


1297 
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GGT 
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1333 


GGT 


GTT 


TAT 


TCC 




1369 


TCT 


GCC 


TGT 


TTT 


TCG 


1405 


TTC 


CCT 


CCC 


ACC 


ACC 


1441 


CGC 


CCT 


GTC 


GGC 


GGG 


1477 


GGT 


GGG 


ACC 


TTG 


GGT 


1513 


GTT 


GGT 


TCT 


GTT 


GTC 


1549 


TTT 


TAA 


TAT 


TGC 


ACC 


1585 


TAA 


ATA 


AAA 


TTT 


TAA 



CCG TGT GTG TCT CTG CAC CTG 
TCC CGG AGC CTG GAT GTC GCC 
CTC ATT TTG CCC CCA GAT CAG 
CTG TCC CAG GAC ATT TTT TAA 
AAA AAA AAT TGG GGA GGG GGC 
AGA TCC TCC TCG GCC CAA CCA 
ATA TAT ATA TAT ATG TTT TGT 
TTT TTT QGT GCG TGG CCT TTC 
ACT CAT GGC CCC AGC CCT GCT 
AGC AGC TGG GAA TGG GAG GAG 
CTG TCT CCC ACC CTC TCT CCC 
GCT CCA GCT GGC TGT ATT GCT 
GAA GGG TTG TTT TTT TTT TTT 
AAA AAG GAA AAA AAA AAA j 



FIG. 11C 
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256 GCC AGC GCC CGC GCG GCG CGG AAC AAG AGA GCT 

ala ser ala arg ala aia arg asn lys arg ala 

289 GGC GAG GAG CGA GTG CTT GAA AAG GAG GAG GAG GAG 
gly glu glu arg val leu glu lys glu glu glu glu 

325 GAG GAG GAG GAA GAC GAC GAG GAC GAC GAC GAC GAC 
glu glu glu glu asp asp glu asp asp asp asp asp 

3 61 GTC GTG TCC GAG GGC TCG GAG GTG CCC GAG AGC GAT 
val val ser glu gly ser glu val pro glu ser asp 

3 97 CGT CCC GCG GGT GCG CAG CAT CAC CAG CTG AAT GGC 

arg pro ala gly ala gin his his gin leu asn gly 

4 33 GGC GAG CGC GGC CCG CAG ACC GCC AAG GAG CGG GCC 

gly glu arg gly pro gin thr ala lys glu arg ala 

4 69 AAG GAG TGG TCG CTG TGT GGC CCC CAC CCT GGC CAG 
lys glu trp ser leu cys gly pro his pro gly gin 

505 GAG GAA GGG CGG GGG CCG GCC GCG GGC AGT GGC ACC 
glu glu gly arg gly pro ala ala gly ser gly thr 

541 CGC CAG GTG TTC TCC ATG GCG GCC TTG AGT AAG GAG 
arg gin val phe ser met ala ala leu ser lys glu 

577 GGG GGA TCA GCC TCT TCG ACC ACC GGG CCT GAC TCC 
gly gly ser ala ser ser thr thr gly pro asp ser 

613 CCG TCC CCG GTG CCT TTG CCC CCC GGG AAG CCA GCC 
pro ser pro val pro leu pro pro gly lys pro ala 

64 9 CTC CCA GGA GCC GAT GGG ACC CCC TTT GGC TGC CCT 
leu pro gly ala asp gly thr pro phe gly cys pro 

68 5 GCC GGG CGC AAA GAG AAG CCG GCA GAC CCC GTG GAG 
ala gly arg lys glu lys pro ala asp pro val glu 

721 TGG ACA GTC ATG GAC GTC GTG GAG TAC TTC ACC GAG 
trp thr val met asp val val glu tyr phe thr glu 



FIG. 12A 
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757 



793 



829 



CCG GGC TTC CCT GAG CAA GCC ACG GCT TTC CAG GAG 
afa gly Phe pro glu gin .la thr ala phe gin glu 

TAG GAG ATC GAC GGC AAG TCC CTG CTG CTC ATG CAG 
tZ glu ill asp gly lys ser leu leu leu met gin 

CGC ACC GAT GTC CTC ACC GGC CTG TCC ATC CGC CTG 
arg thr asp val leu thr gly leu ser 11. arg leu 

o« CGG CCA GCG TTG AAA ATC TAT GAG CAC CAT ATC AAG 
865 g?y pro S leu lys il. tyr glu his his xl. lys 

en, CTG CTG CAG CAG GGT CAC TTC GAG GAC GAT GAC CCG 
901 vll lei gin gin gly his phe glu asp asp asp pro 

937 GAA GGC TTC CTG GGA TGA GCA CAG AGC CGC CGC GCC 

glu gly phe leu gly 
973 CCT TGT CCC CAC CCC CAC CCC GCC TOG ACC CAT TCC 
Zl S TGG ACT S £ E S S GCG GAC CTA CCC 

- s s i i i i i s s s s gcc 

1153 GCC CCA GCA CAC CCC TCC CGG ^ 

1189 CCG CCT CCC TCC CTC CTG t£ CAG GAC ATT TTT TAA 
IIH AAG AAA AAA AGG AAA AAA AAA AAT TGG GGA GGG GGC 

S GTT Z TCC AT A A TAT ATA TAT ATG TTT TGT 
SS TCT fee TGT TTT TCG TTT TTT GGT GCG TGG CCT TTC 
14 05 TTC CCT CCC ACC ACC ACT CAT GGC 

x441 CGC CCT GTC GGC GGG AGC AGC TGG GAA ^ 
1477 GGT GGG ACC TTG GGT CTG TCT CCC ^ 
15 13 GTT GGT TCT GTT GTC GCT CCA GCT 

TTT TAA TAT TGC ACC GAA GGG 
1549 TAA ATA AAA TTT TAA AAA AAG GAA AAA AAA AAA 



1585 



FIG. 12B 
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ACC CGT CTC GGA GCG CTT GCG 
196 thr arg leu gly ala leu ala 

,17 CTG CCC CGC GGG GAC AGG CCC GGA CGG GCG CCA CCG 
leu pro arg gly asp arg pro gly arg ala pro pro 

253 GCC GCC AGC GCC CGC GCG GCG CGG AAC AAG AGA GCT 
ala ala ser ala arg ala ala arg asn lys arg ala 

789 GGC GAG GAG CGA GTG CTT GAA AAG GAG GAG GAG GAG 
gly glu glu arg val leu glu lys glu glu glu glu 

325 GAG GAG GAG GAA GAC GAC GAG GAC GAC GAC GAC GAC 
glu glu glu glu asp asp glu asp asp asp asp asp 

361 GTC GTG TCC GAG GGC TCG GAG GTG CCC GAG AGC GAT 
val val ser glu gly ser glu val pro glu ser asp 

397 CGT CCC GCG GGT GCG CAG CAT CAC CAG CTG AAT GGC 
arg pro ala gly ala gin his his gin leu asn gly 

433 GGC GAG CGC GGC CCG CAG ACC GCC AAG GAG CGG GCC 
gly glu arg gly pro gin thr ala lys glu arg ala 

469 AAG GAG TGG TCG CTG TGT GGC CCC CAC CCT GGC CAG 
lys glu trp ser leu cys gly pro his pro gly gin 

505 GAG GAA GGG CGG GGG CCG GCC GCG GGC AGT GGC ACC 
glu glu gly arg gly pro ala ala gly ser gly thr 

541 CGC CAG GTG TTC TCC ATG GCG GCC TTG AGT AAG GAG 
arg gin val phe ser met ala ala leu ser lys glu 

577 GGG GGA TCA GCC TCT TCG ACC ACC GGG CCT GAC TCC 
gly gly ser ala ser ser thr thr gly pro asp ser 

, n CCG TCC CCG GTG CCT TTG CCC CCC GGG AAG CCA GCC 
pro ser pro val pro leu pro pro gly lys pro ala 

649 CTC CCA GGA GCC GAT GGG ACC CCC TTT GGC TGC CCT 
leu pro gly ala asp gly thr pro phe gly cys pro 

FIG. 13A 
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685 GCC GGG CGC AAA GAG AAG CCG GCA GAC CCC GTG GAG 
ala gly arg lys glu lys pro ala asp pro val glu 

721 TGG ACA GTC ATG GAC GTC GTG GAG TAG TTC ACC GAG 
trp thr val met asp val val glu tyr phe thr glu 

757 GCG GGC TTC CCT GAG CAA GCC ACG GCT TTC CAG GAG 
ala gly phe pro glu gin ala thr ala phe gin glu 

793 CAG GAG ATC GAC GGC AAG TCC CTG CTG CTC ATG CAG 
gin glu ile asp gly lys ser leu leu leu met gin 

829 CGC ACC GAT GTC CTC ACC GGC CTG TCC ATC CGC CTG 
arg thr asp val leu thr gly leu ser ile arg leu 

865 GGG CCA GCG TTG AAA ATC TAT GAG CAC CAT ATC AAG 
gly pro ala leu lys ile tyr glu his his He lys 

901 GTG CTG CAG CAG GGT CAC TTC GAG GAC GAT GAC CCG 
val leu gin gin gly his phe glu asp asp asp pro 

GAA GGC TTC CTG GGA TGA GCA CAG AGC CGC CGC GCC 
glu gly phe leu gly 

CCT TGT CCC CAC CCC CAC CCC GCC TGG ACC CAT TCC 
TGC CTC CAT GTC ACC CAA GGT GTC CCA GAG GCC AGG 
AGC TGG ACT GGG CAG GCG AGG GGT GCG GAC CTA CCC 
TGA TTC TGG TAG GGG GCG GGG CCT TGC TGT GCT CAT 
TGC TAG CCC CCC ACC CCG TGT GTG TCT CTG CAC CTG 



937 



973 
1009 
1045 
1081 

1153 CCC CCA GCA CAC CCC TCC CGG AGC CTG GAT GTC GCC 
\\\\ TGG GAC TCT GGC CTG CTC ATT TTG CCC CCA GAT CAG 

1225 
1261 
1297 

^69 ^CT IcC TGT TO TGG TTT TTT GGT GCG TGG CCT TTC 

Hoi TTC CCT CCC ACC ACC ACT CAT GGC CCC AGC CCT GCT 

14 41 CGC CCT GTC GGC GGG AGC AGC TGG GAA TGG GAG GAG 

1477 GGT GGG ACC TTG GGT CTG TCT CCC ACC CTC TCT CCC 

1513 GTT GGT TCT GTT GTC GCT CCA GCT GGC TGT ATT GCT 

His TTT TAA TAT TGC ACC GAA GGG TTG TTT TTT TTT TTT 

1585 TAA ATA AAA TTT TAA AAA AAG GAA AAA AAA AAA 



CCC CCT CCC TCC CTC CTG TCC CAG GAC ATT TTT TAA 
AAG AAA AAA AGG AAA AAA AAA AAT TGG GGA GGG GGC 
TGG GAA GGT GCC CCA AGA TCC TCC TCG GCC CAA CCA 
GGT GTT TAT TCC TAT ATA TAT ATA TAT ATG TTT TGT 



FIG. 13B 
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1 GTG GAA AAT AGC AAC TGT GTT TCT CAA GGA TCC AAT 
37 CCC AAC CTA AGG TGG CAG CGC ACA ATG AAG AAT CAA 

met lys asn gin 

73 GAC AAA AAG AAC GGG GCT GCC AAA CAG CCC AAC CCC 
asp lys lys asn gly ala ala lys gin pro asn pro 

109 AAA AGC AGC CCG GGA CAG CCG GAA GCA GGA GCG GAG 
lys ser ser pro gly gin pro glu ala gly ala glu 

145 GGA GCC CAG GGG CGG CCC GGC CGG CCG GCC CCC GCC 
gly ala gin gly arg pro gly arg pro ala pro ala 

181 CGA GAA GCC GAA GGT GCC AGC AGC CAG GCT CCC GGG 
arg glu ala glu gly ala ser ser gin ala pro gly 

217 AGG CCG GAG GGG GCT CAA GCC AAA ACT GCT CAG CCT 
arg pro glu gly ala gin ala lys thr ala gin pro 

253 GGG GCG CTC TGT GAT GTC TCT GAG GAG CTG AGC CGC 
gly ala leu cys asp val ser glu glu leu ser arg 

289 CAG TTG GAA GAC ATA CTC AGT ACA TAC TGT GTG GAC 
gin leu glu asp ile leu ser thr tyr cys val asp 

325 AAC AAC CAG GGG GCC CCG GGT GAG GAT GGG GTC CAG 
asn asn gin gly ala pro gly glu asp gly val gin 

361 GGT GAG CCC CCT GAA CCT GAA GAT GCA GAG AAG TCT 

gly glu P ro P ro 9 lu P ro 9 lu asp ala glu lys ser 

3 97 CGC GCC TAT GTG GCA AGG AAT GGG GAG CCG GAG CCG 
arg ala tyr val ala arg asn gly glu pro glu pro 

433 GGC ACC CCA GTA GTC AAT GGC GAG AAG GAG ACC TCC 
gly thr pro val val asn gly glu lys glu thr ser 

469 AAG GCA GAG CCG GGC ACG GAA GAG ATC CGG ACG AGC 
lys ala glu pro gly thr glu glu ile arg thr ser 

505 GAT GAG GTC GGA GAC CGA GAC CAC CGG AGG CCA CAG 
asp glu val gly asp arg asp his arg arg pro gin 

FIG. 14A 
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541 GAA AAG AAG AAG GCC AAG 
glu lys lys lys ala lys 

577 ACG CTG CTG ATG CAG ACA 
thr leu leu met gin thr 

613 CCA GAG GAG AAG CTG GCG 
pro glu glu lys leu ala 

649 GCG GAA CTG CTC GAG GAG 
ala glu leu leu glu glu 

685 CAG ATG AAG CTG CTG CAG 
gin met lys leu leu gin 

721 GTG CAG GAG AAG GAC CAC 
val gin glu lys asp his 

757 AAG GCC ATC CTG GCC CGC 
lys ala ile leu ala arg 

793 TGC CGG GAG CTG CAG CGG 
cys arg glu leu gin arg 

82 9 GAA GAA GGT GTG CAG CGA 
glu glu gly val gin arg 

865 AAG CGC AAG GAG GTG ACG 
lys arg lys glu val thr 

901 CTC AAC GAC ATT CAG CTG 
leu asn asp ile gin leu 

937 GAG CGC AAC TCC AAG CTG 
glu arg asn ser lys leu 

973 CTG GCC GAG CGG CTC AAG 
leu ala glu arg leu lys 

1009 GAG CTG CGA GAA GAG CAC 
glu leu arg glu glu his 

FIG. 



GGT CTG GGA AAG GAG ATC 
gly leu gly lys glu ile 

CTG AAC ACG CTG AGC ACC 
leu asn thr leu ser thr 

GCT CTG TGC AAG AAG TAT 
ala leu cys lys lys tyr 

CAC CGG AAC TCG CAG AAG 
his arg asn ser gin lys 

AAG AAG CAG AGC CAG CTG 
lys lys gin ser gin leu 

CTG CGT GGC GAG CAC AGC 
leu arg gly glu his ser 

AGC AAG CTC GAG AGC CTG 
ser lys leu glu ser leu 

CAC AAC CGC TCG CTC AAG 
his asn arg ser leu lys 

GCC CGA GAG GAG GAG GAG 
ala arg glu glu glu glu 

TCA CAC TTC CAG ATG ACG 
ser his phe gin met thr 

CAG ATG GAG CAG CAC AAC 
gin met glu gin his asn 

CGC CAG GAG AAC ATG GAG 
arg gin glu asn met glu 

AAG CTG ATT GAG CAG TAC 
lys leu ile glu gin tyr 

ATC GAC AAA GTC TTC AAA 
ile asp lys val phe lys 

14B 
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1045 CAC AAG GAT CTG CAG CAG CAG CTG GTG GAC GCC AAG 
his lys asp leu gin gin gin leu val asp ala lys 

1081 CTC CAG CAG GCC CAG GAG ATG CTG AAG GAG GCA GAG 
leu gin gin ala gin glu met leu lys glu ala glu 

1117 GAG CGG CAC CAG CGG GAG AAG GAC TTT CTC CTG AAG 
glu arg his gin arg glu lys asp phe leu leu lys 

1153 GAG GCC GTG GAG TCC CAG AGG ATG TGC GAG CTG ATG 
glu ala val glu ser gin arg met cys glu leu met 

1189 AAG CAA CAG GAG ACC CAC CTG AAG CAG CAG CTT GCC 
lys gin gin glu thr his leu lys gin gin leu ala 

1225 CTA TAC ACA GAG AAG TTT GAG GAG TTC CAG AAC ACT 
leu tyr thr glu lys phe glu glu phe gin asn thr 

1261 CTT TCC AAA AGC AGC GAG GTG TTC ACC ACA TTC AAA 
leu ser lys ser ser glu val phe thr thr phe lys 

12 97 CAG GAA ATG GAA AAG ATG ACA AAG AAG ATC AAG AAG 

gin glu met glu lys met thr lys lys ile lys lys 

1333 CTG GAG AAA GAG ACC ACC ATG TAC CGT TCC CGG TGG 
leu glu lys glu thr thr met tyr arg ser arg trp 

13 69 GAG AGC AGC AAC AAG GCC CTG CTT GAG ATG GCT GAG 

glu ser ser asn lys ala leu leu glu met ala glu 

14 05 GAG AAA ACA CTC CGG GAC AAA GAG CTG GAA GGC CTG 

glu lys thr leu arg asp lys glu leu glu gly leu 

1441 CAG GTG AAA ATC CAG CGG CTG GAG AAG CTG TGC CGG 
gin val lys ile gin arg leu glu lys leu cys arg 

14 77 GCA CTG CAG ACA GAG CGC AAT GAC CTG AAC AAG AGG 
ala leu gin thr glu arg asn asp leu asn lys arg 

1513 GTG CAG GAC CTG AGT GCC GGT GGC CAG GGC CCC GTC 
val gin asp leu ser ala gly gly gin gly pro val 

FIG. 14C 
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1549 



1585 



1621 



1657 



TCC GAC AGC GGT CCT GAG CGG AGG CCA GAG CCC GCC 
ser asp ser gly pro glu arg arg pro glu pro ala 

ACC ACC TCC AAG GAG CAG GGT GTC GAG GGC CCC GGG 
thr thr ser lys glu gin gly val glu gly pro gly 

GCT CAA GTA CCC AAC TCT CCA AGG GCC ACA GAC GCT 
ala gin val pro asn ser pro arg ala thr asp ala 

TCC TGC TGC GCA GGT GCA CCC AGC ACA GAG GCA TCA 
ser cys cys ala gly ala pro ser thr glu ala ser 

1693 GGC CAG ACA GGG CCC CAG GAG CCC ACC ACT GCC ACT 
gly gin thr gly pro gin glu pro thr thr ala thr 

1729 GCC TAG AGA GCT TGG TGC TGG GGT GTG CCA GGA AGG 
ala 

GAG CAG GCA GCC CAG CCA GGC CTG GCC CAG CCC AGG 
CTC CCA TGC TAA GCA GTC CGG TGC TGA GGC CAG GAT 
OT CTG ACC TGG CTG GCA CCT GAC CCT CTG CAG TCT 
TGG ATT TTG TGG GTC AGT TTT ACA TGC ATA TGG CAC 
IS TGC AAG GCC TCA CAC ATT TGT GTC TCT AAG TGT 
ACT GTG GGC TTG CAT CGG GGG TGA CGA TGG ACA GAT 
GAA GCC AGC GGC TCC CTT GTG AGC TGA AGT CTT ACG 
GA^ GAG ACG GCG TCT GCA CTG CCA TCG CAG TGA CCT 
Get GGA CGA GTT CCT TGA GCT TTC CCT GCC TGC TTT 
GAG GCT GAG ACC CCT CCC GGC CCT TCA GAG CTC CTG 
ACA GGT GAT ACA CAC CCA GCC TTG ACC GCA CTT CTC 
TTG GGT AGC TGG GCT CTC CTA GCC TCC CCC AGA GGC 
GCC ATT GCT TCT CTT GAC TTG GAG AGG GGA TGC CCA 
GGC GTG GCC TTG GCA GGC ACT GGG AGC TAG TGA TTG 
GGC TGC TCT CCT GCC TCG AGC AGG GGC AGG AGT GTT 
?CT GGT GGG ATG ATG CGC TCG CTG GTC AGG AGC CCC 
GTG GGC GCT GCT TCC CCC GCC CTC TGG TGA TGC CAG 
GAC CAG GCC AGT GAT GCT TCT CAG TAG CCT TAC CAT 
T-A CAG GTG CCT CTC CAG CCC GCA CAG TGA GTG ACA 
AGA TCA TCC AAA GGA TTC CTT CTG AAG GTG TTC GTT 
TCG TTT TGT TTT GTT GCA CGT GAC GGT TTG TAT TGA 
GGA CCC TCT GAG GAA GAG GGG TGC TGT AGC AGT GGT 
CCC TGC GTG CCT GGC TCC AGT GTC CTG CCC TCC CCC 
CCC TCG CCA TGG CTC CTC GGC CGC CTT GGT GCT GAG 
GTT TCT GTT TGG TGA GAT CAG GTT GTC TGT TCA GAG 



1765 
1801 
1837 
1873 
1909 
1945 
1981 
2017 
2053 
2089 
2125 
2161 
2197 
2233 
2269 
2305 
2341 
2377 
2413 
2449 
2485 
2521 
2557 
2593 
2629 



FIG. 14D 



Matter No.: 1 0797-004006 Page 34 of 67 

Applicant(s): Ann M. Lees et al. 

NOVEL LOW DENSITY LIPOPROTEIN BINDING PROTEINS 
AND THEIR USE IN DIAGNOSING AND TREATING 
ATHEROSCLEROSIS 



2665 


AGA 


AGA < 


2701 


TGC 


GGG 


2737 


TCA 


CTG 


2773 


AGG 


ACC 


2809 


TGC 


TTC 


2845 


TGG 


GGT 


2881 


GCC 


CAA 


2917 


CCC 


ATG 


2953 


CTG 


AGA 


2989 


CAC 


GGT 


3025 


ACC 


GCT 


3061 


GCA 


TTT 


3097 


GGC 


CTC 


3133 


CAG 


GGA 


3169 


GGG 


TTT 


3205 


TGT 


TGT 


3241 


GCT 


CAG 


3277 


TTC 


ATG 


3313 


GCT 


GAG 


3349 


CTG 


CTG 


3385 


GAA 


ATG 


3421 


GTG 


CCA 


3457 


TTG 


CCC 


3493 


GGG 


AAT 


3529 


TTC 


CCC 


3565 


CAA 


GAA 


3601 


GCC 


AGG 


3637 


CCC 


TTG 


3673 


CTG 


GCT 


3709 


GGG 


GTG 


3745 


GAA 


GCT 


3781 


CCA 


GCT 


3817 


CTT 


TGC 


3853 


CAG 


TCT 


3889 


GAC 


CAA 


3925 


CTC 


TGC 


3961 


CGC 


CCA 


3997 


GGA 


GGT 


4033 


CCT 


CAC 


4069 


TTA 


GGA 


4105 


AGC 


TTT 


4141 


GGA 


GCC 


4177 


GTG 


TCC 



FIG. 14E 



Matter No.: 1 0797-004006 Page 35 of 67 

Applicant(s): Ann M. Lees et al. 

NOVEL LOW DENSITY LIPOPROTEIN BINDING PROTEINS 
AND THEIR USE IN DIAGNOSING AND TREATING 
ATHEROSCLEROSIS 



4213 


GAC 


TCA 


GCT 


I LLj 


TOT 
1 vj 1 


GTG 


AGT 


CCA 


GCG 


GAG 


TGC 


GTC 


4249 


ACG 


TGA 


prim 

CCT 


"A r 1 TS. 


PTP 
Lit 


agp 


GGT 


GTC 


AGC 


CGC 


ACT 


TTG 


4285 


ATT 


TGT 


TTG 


TTT 


1 v^\- 


ATG 


AGG 


TTT 


TTG 


GAC 


CAT 


GGG 


4321 


CTT 


AGC 


TCA 


CjCjC 


HAL 


111 


TCT 


GTA 


AGG 


AGA 


ATG 


TTA 


4357 


ACT 


TTC 


TGT 


7\ A 7\ 

AAA 


r* at 


GGT 


TAT 


TTA 


ACT 


AAC 


GCC 


TGC 


4393 


TTC 


ccc 


CAC 


m /-i r~i 

TCC 






\3\J 1 


GGC 


CAC 


CGA 


GAG 


CTC 


4429 


ACL 




Add 


CCA 


ATA 


GAG 


CTG 


CTC 


CAG 


CTC 


TCC 


CAT 


4465 


CI i 






CAC 


AAA 


GGT 


GGC 


CGC 


CCC 


AGG 


GAC 


AGC 


4501 


CAG 


GCA 


CCT 


GCC 


TGG 


GGG 


AGG 


GGC 


TTC 


TCT 


TCC 


TTA 


4537 


TGG 


CCT 


GGC 


CAT 


CTA 


GAT 


TGT 


TTA 


AAG 


TTG 


TGC 


TGA 


4573 


CAG 


CTT 


TTT 


TTG 


GTT 


TTT 


TGG 


TTT 


TTG 


TTT 


TTG 


TTT 


4609 


TTG 


TTT 


TTG 


TTT 


TTG 


TCT 


ACT 


TTT 


GGT 


ATT 


CAC 


AAC 


4645 


AGC 


CAG 


GGA 


CTT 


GAT 


TTT 


GAT 


GTA 


TTT 


TAA 


GCC 


ACA 


4681 


TTA 


AAT 


AAA 


GAG 


TCT 


GTT 


GCC 


TTA 


AAA 


AAA 


AAA 


AAA 


4717 


AAA 


AAA 























FIG. 14F 



Matter No. : 1 0797-004006 Page 36 of 67 

Applicant(s): Ann M. Lees et al. 

NOVEL LOW DENSITY LIPOPROTEIN BINDING PROTEINS 
AND THEIR USE IN DIAGNOSING AND TREATING 
ATHEROSCLEROSIS 



I GAC GCC TCA GAG CGG AAC AGG GAA GTG AAT CAG GCG 
3 7 CCG GGT AGT GGG TTG CTG GGC TGG GCT TGC TGA GGT 
73 AGA GGC AGC GCC AAG AAG AGG CCT TTG CCG CTG GTC 
10 9 GGG ATT GGG ATG TCG AAG AAC ACA GTG TCG TCG GCC 

met ser lys asn thr vai ser ser ala 

14 5 CGC TTC CGG AAG GTG GAC GTG GAT GAA TAT GAC GAG 
arg phe arg lys val asp val asp glu tyr asp glu 

181 AAC AAG TTC GTG GAC GAA GAA GAT GGG GGC GAC GGC 
asn lys phe val asp glu glu asp gly gly asp gly 

217 CAG GCC GGG CCC GAC GAG GGC GAG GTG GAC TCC TGC 
gin ala gly pro asp glu gly glu val asp ser cys 

253 CTG CGG CAA GGA AAC ATG ACA GCT GCC CTA CAG GCA 
leu arg gin gly asn met thr ala ala leu gin ala 

289 GCT CTG AAG AAC CCC CCT ATC AAC ACC AAG AGT CAG 
ala leu lys asn pro pro lie asn thr lys ser gin 

325 GCA GTG AAG GAC CGG GCA GGC AGC ATT GTC TTG AAG 
ala val lys asp arg ala gly ser ile val leu lys 

3 61 GTG CTC ATC TCT TTT AAA GCT AAT GAT ATA GAA AAG 
val leu ile ser phe lys ala asn asp ile glu lys 

397 GCA GTT CAA TCT CTG GAC AAG AAT GGT GTG GAT CTC 
ala val gin ser leu asp lys asn gly val asp leu 

433 CTA ATG AAG TAT ATT TAT AAA GGA TTT GAG AGC CCG 
leu met lys tyr ile tyr lys gly phe glu ser pro 

469 TCT GAC AAT AGC AGT GCT ATG TTA CTG CAA TGG CAT 
ser asp asn ser ser ala met leu leu gin trp his 

505 GAA AAG GCA CTT GCT GCT GGA GGA GTA GGG TCC ATT 
glu lys ala leu ala ala gly gly val gly ser lie 

541 GTT CGT GTC TTG ACT GCA AGA AAA ACT GTG TAG TCT 
val arg val leu thr ala arg lys thr val 
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577 GGC AGG AAG TGG ATT ATC TGC CTC GGG AGT GGG AAT 
613 TGC TGG TAC AAA GAC CAA AAC AAC CAA ATG CCA CCG 
64 9 CTG CCC TGT GGG TAG CAT CTG TTT CTC TCA GCT TTG 
6 85 CCT TCT TGC TTT TTC ATA TCT GTA AAG AAA AAA ATT 
721 ACA TAT CAG TTG TCC CTT TAA TGA AAA TTG GGA TAA 
757 TAT AGA AGA AAT TGT GTT AAA ATA GAA GTG TTT CAT 
793 C r T TTC AAA ACC ATT TCA GTG ATG TTT ATA CCA ATC 
82 9 TGT ATA TAG TAT AAT TTA CAT TCA AGT TTT AAT TGT 
865 GCA ACT TTT AAC CCT GTT GGC TGG TTT TTG GTT CTG 
901 TTT GGT TTT GTA TTA TTT TTA ACT AAT ACT GAA AAA 
93 7 TTT GGT CAG AAT TTG AGG CCA GTT TCC TAG CTC ATT 
973 GCT AGT CAG GAA ATG ATA TTT ATA AAA AAT ATG AGA 
1009 GAC TGG CAG CTA TTA ACA TTG CAA AAC TGG ACC ATA 
1045 TTT CCC TTA TTT AAT AAG CAA AAT ATG TTT TTG GAA 
1081 TAA GTG GTG GGT GAA TAC CAC TGC TAA GTT ATA GCT 
1117 TTG TTT TTG CTT GCC TCC TCA TTA TCT GTA CTG TGG 
1153 GTT TAA GTA TGC TAC TTT CTC TCA GCA TCC AAT AAT 
1189 CAT GGC CCC TCA ATT TAT TTG TGG TCA CGC AGG GTT 
1225 CAG AGC AAG AAG TCT TGC TTT ATA CAA ATG TAT CCA 
1261 TAA AAT ATC AGA GCT TGT TGG GCA TGA ACA TCA AAC 
1297 TTT TGT TCC ACT AAT ATG GCT CTG TTT GGA AAA AAC 
1333 TGC AAA TCA GAA AGA ATG ATT TGC AGA AAG AAA GAA 
1369 AAA CTA TGG TGT AAT TTA AAC TCT GGG CAG CCT CTG 
1405 AAT GAA ATG CTA CTT TCT TTA GAA ATA TAA TAG CTG 
TWT, CCT TAG ACA TTA TGA GGT ATA CAA CTA GTA TTT AAG 
1477 ATA CCA TTT AAT ATG CCC GGT AAA TGT CTT CAG TGT 
1513 TCT TCA GGG TAG TTG GGA TCT CAA AAG ATT TGG TTC 
1S49 AGA TCC AAA CAA ATA CAC ATT CTG TGT TTT AGC TCA 
1585 GTG TTT TCT AAA AAA AGA AAC TGC CAC ACA GCA AAA 
1621 AAT TGT TTA CTT TGT TGG ACA AAC CAA ATC AGT TCT 
1657 CAA AAA ATG ACC GGT GCT TAT AAA AAG TTA TAA ATA 
1693 TCG AGT AGC TCT AAA ACA AAC CAC CTG ACC AAG AGG 
1729 GAA GTG AGC TTG TGC TTA GTA TTT ACA TTG GAT GCC 
1765 AGT TTT GTA ATC ACT GAC TTA TGT GCA AAC TGG TGC 
1801 AGA AAT TCT ATA AAC TCT TTG CTG TTT TTG ATA CCT 
1837 GCT TTT TGT TTC ATT TTG TTT TGT TTT GTA AAA ATG 
1873 ATA AAA CTT CAG AAA ATA AAA TGT CAG TGT TGA ATA 
1909 ATT AAA AAA AAA AAA AA 
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1 


GAA 
glu 


GAG 
glu 


CGA 
arg 


GTA 
val 


CTT GAG 
leu glu 


AAA GAA GAG GAA GAA GAT 
lys glu glu glu glu asp 


37 


GAT 
asp 


GAT 
asp 


GAA 
glu 


GAT 
asp 


GAA 
glu 


GAT 
asp 


GAA GAA GAT GAT GTG TCA 
glu glu asp asp val ser 


73 


GAG 
glu 


GGC 
gly 


TCT 
ser 


GAA 
glu 


GTG 
val 


CCC 
pro 


GAG AGT GAC CGT CCT GCA 
glu ser asp arg pro ala 


109 


GGT 
gly 


GCC 
ala 


CAG CAC 
gin his 


CAC 
his 


CAG 
gin 


CTT AAC GGC GAG CGG GGA 
leu asn gly glu arg gly 


145 


CCT 
pro 


CAG 
gin 


AGT 
ser 


GCC 
ala 


AAG 
lys 


GAG 
glu 


AGG GTC AAG GAG TGG ACC 
arg val lys glu trp t ; hr 


181 


CCC 
pro 


TGC 
cys 


GGA CCG 
gly pro 


CAC 
his 


CAG GGC CAG GAT GAA GGG CGG 
gin gly gin asp glu gly arg 


217 


GGG 
gly 


CCA 
pro 


GCC 
ala 


CCG 
pro 


GGC 
gly 


AGC 
ser 


GGC ACC CGC CAG GTG TTC 
gly thr arg gin val phe 


253 


TCC 
ser 


ATG 
met 


GCA 
ala 


GCC 
ala 


ATG 
met 


AAC 
asn 


AAG GAA GGG GGA ACA GCT 
lys glu gly gly thr ala 


289 


TCT 
ser 


GTT 
val 


GCC ACC GGG CCA 
ala thr gly pro 


GAC TCC CCG TCC CCC GTG 
asp ser pro ser pro val 


325 


CCT 
pro 


TTG 
leu 


CCC 
pro 


CCA 
pro 


GGC 
gly 


AAA 
lys 


CCA GCC CTA CCT GGG GCC 
pro ala leu pro gly ala 


361 


GAC 
asp 


GGG 
gly 


ACC 
thr 


CCC 
pro 


TTT 
phe 


GGC TGT CCT CCC GGG CGC AAA 
gly cys pro pro gly arg lys 


397 


GAG 

glu 


lys 


CCA 
pro 


TCT 
ser 


GAT 
asp 


CCC 
pro 


GTC GAG TGG ACC GTG ATG 
val glu trp thr val met 


433 


GAT 
asp 


GTC 
val 


GTC 
val 


GAA 
glu 


TAT TTT 
tyr phe 


ACT GAG GCT GGA TTC CCG 
thr glu ala gly phe pro 


469 


GAG 
glu 


CAG 
gin 


GCG 
ala 


ACA 
thr 


GCT 
ala 


TTC 
phe 


CAA GAG CAG GAA ATT GAT 
gin glu gin glu ile asp 



FIG. 16A 



Matter No.: 10797-004006 Page 39 of 67 

Applicant(s): Ann M. Lees et al. 

NOVEL LOW DENSITY LIPOPROTEIN BINDING PROTEINS 
AND THEIR USE IN DIAGNOSING AND TREATING 
ATHEROSCLEROSIS 



505 GGC AAA TCT TTG CTG 
gly lys ser leu leu 

541 CTC ACC GGC CTG TCC 
leu thr gly leu ser 

577 AAA ATC TAC GAG CAC 
lys ile tyr glu his 

613 GGC CAC TTT GAG GAT 
gly his phe glu asp 

64 9 GGC TGA GCG CCC AGC 
gly 

6 85 TCC GGC CCC CAT CTC 

721 AGG AGC TGG ACG GGG 

757 GAT TCC AAG GAG AGG 

793 TGC CCC TTG TNT CTG 

829 AGC ACG TCG GTG TGG 

865 CCC AGG TGG CTG ACC 

901 TTT TAG GAA AAA AAA 

937 CAT CTC CCC AAG ATC 

973 TTC CTG TAT AAA TGT 

1009 GGT GGG TGG TCT TTC 

1045 CCC CCC TTC TCA GTC 

1081 AGG GGA CTA GCT GGG 

1117 CTC TCC TCC CTC TTT 

1153 CAG CTG GCT GTA TTG 

1189 GTT TTT TAA ATA AAA 



CTC ATG CAG CGC ACA GAT GTG 
leu met gin arg thr asp val 

ATC CGC CTC GGG CCA GCC CTG 
ile arg leu gly pro ala leu 

CAC ATC AAG GTG CTT CAG CAA 
his ile lys val leu gin gin 

GAT GAC CCC GAT GGC TTC TTA 
asp asp pro asp gly phe leu 

CTC ACC CCT GCC CCA GCC CAT 



ACC CAA GAT CCC CCA GAG TCC 
ACA CCC TCA GCC CTC ATA ACA 
GCA CCC TCT TGT CCT TAT CTT 
TCT CAC ACA CAT CTG CTC CTC 
GGA GGG GAT TGC TCC TTA AAC 
CTC CCC ACC CAG TCC AGG ACA 
AAT GAA ATG TGG GGG GCT TCT 
CTC TTC CGT TCA GCC AGA TGT 
TTG GAT CTG CCT GTT TAT TTT 
CTC CCT CCC CTA CCA CCC ATG 
TGC CCC TGG CCT CCA GCC CCT 
TTG GGG TTC CTC GGG CCT TTT 
TCT TTC TGT TGA TTG TCG CTC 
CTT TTT AAT ATT GCA CCG AAG 
TTT TA 
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1 CA AAA AGC AGC CCA GGA CAA 
lys ser ser pro gly gin 

45 CAG GAG CGG CCC AGC CAG GCG 
gin glu arg pro ser gin ala 

90 CCC GGC AGC AGC CAG GCT CCT 
pro gly ser ser gin ala pro 

135 AGA ACG GCT CAG TCT GGG GCC 
arg thr ala gin ser gly ala 

180 AGC CGC CAA CTG GAA GAC ATA 
ser arg gin leu glu asp ile 

225 AAC CAG GGG GGC CCC GGC GAG 
asn gin gly gly pro gly glu 

270 GAA CCC GAA GAT GCA GAG AAG 
glu pro glu asp ala glu lys 

315 GGG GAG CCT GAA CCA ACT CCA 
gly glu pro glu pro thr pro 

360 TCC AAG GGG GAT CCA AAC ACA 
ser lys gly asp pro asn thr 

4 05 GTC GGA GAC CGA GAC CAT CGA 
val gly asp arg asp his arg 

450 AAG GGT TTG GGG AAG GAG ATC 
lys gly leu gly lys glu ile 

495 ACT CTG AGT ACC CCA GAG GAG 
thr leu ser thr pro glu glu 

540 TAT GCT GAA CTG CTG GAG GAG 
tyr ala glu leu leu glu glu 

58 5 AAG CTC CTA CAG AAA AAG CAG 
lys leu leu gin lys lys gin 

63 0 CAC CTG CGC GGT GAG CAC AGC 
his leu arg gly glu his ser 

675 CTT GAG AGC CTA TGC CGT GAG 
leu glu ser leu cys arg glu 



CCG GAA GCA GGA CCC GAG GGA GCC 
pro glu ala gly pro glu gly ala 

GCT CCT GCA GTA GAA GCA GAA GGT 
ala pro ala val glu ala glu gly 

CGG AAG CCG GAG GGG GCT CAA GCC 
arg lys pro glu gly ala gin ala 

CTT CGT GAT GTC TCT GAG GAG CTG 
leu arg asp val ser glu glu leu 

CTG AGC ACA TAC TGT GTG GAC AAT 
leu ser thr tyr cys val asp asn 

GAT GGG GCA CAG GGT GAG CCG GCT 
asp gly ala gin gly glu pro ala 

TCC CGG ACC TAT GTG GCA AGG AAT 
ser arg thr tyr val ala arg asn 

GTA GTC TAT GGA GAG AAG GAA CCC 
val val tyr gly glu lys glu pro 

GAA GAG ATC CGG CAG AGT GAC GAG 
glu glu ile arg gin ser asp glu 

AGG CCA CAG GAG AAG AAA AAA GCC 
arg pro gin glu lys lys lys ala 

ACG TTG CTG ATG CAG ACA TTG AAT 
thr leu leu met gin thr leu asn 

AAG CTG GCT GCT CTG TGC AAG AAG 
lys leu ala ala leu cys lys lys 

CAC CGG AAT TCA CAG AAG CAG ATG 
his arg asn ser gin lys gin met 

AGC CAG CTG GTG CAA GAG AAG GAC 
ser gin leu val gin glu lys asp 

AAG GCC GTC CTG GCC CGC AGC AAG 
lys ala val leu ala arg ser lys 

CTG CAG CGG CAC AAC CGC TCC CTC 
leu gin arg his asn arg ser leu 
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thr glu lys phe glu glu phe gin asn tnr xeu 
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«i GCA CCC AGC TCC CCC AGG GTC ACA GAA GCG CCT TGC TAG CCA 
1485 IT. £ P- sfr ser pro arg val thr glu ala pro cys tyr pro 

nr* GCA CCG AGC ACA GAA GCA TCA GGC CAG ACT GGG CCT CAA GAG 
1530 S P« ser thr glu ala ser gly gin thr gly pro gin glu 

pro thr ser ala arg ala *** 

- GGA G £ GAG CAG S S S S S S S S 55 S £ 

-J CTG S S £ A TGC AAT TXT GGA TTT TGT GGG TCA GTT TTA 

175 5 CGT ACA TAG GGC ATT TTG CAA GGC CTT GCA ^ ^ 

180 0 GTA AGT GTA CAG TGG GCT TGC ATT GGG GAT ^ 

l8 45 TGA AGT CAG TGG CTT GTC TGT GAG CTG AAG AGT ^ ^ 

1890 TGT CAT CTG TAG CTG CCA TCA CAG TGA GTT w, 

J' 5 AGC ATT TCT CTG TCT GAT TTG AGG CTC AGA CCC CTC CCT ^ 

1980 TCA GAG CTC AAA ACA AGT AAT ACA CCA AGG TCT ^ 

202 5 TCT TGT GAG CAG GGC TTG CTT GGT CAG CTC AGG ^ 

20 70 CTT GGA GGC TCC TTT GAT TCT CTA GAC CTG ^ ^ ^ 

|£ « AAG GGA S £ £ S °« GTO GGA TGA TGT G CT GGT 



^ u ~~ _ ^ „. TrT TTT TGC GTG GGA TGA TUT 

216 °, Sg Sg ecc cS £ SS £ £ £ tgc cct ttg gta gtg cca 

2205 rrl £ GGC CAA TGA TGC TTC TCA GTA GCC TTA TCA 

22 50 GGA CCA GGC CAA TG ^ ^ ^ ^ _ _ 

2295 GCC TCT CTA GCC TGC ACA AAi ttt 
ATT ATT TCT GAA GGT GTT TTT TTC TTT ATT ^ ^ ^ ^ ^ 

2385 TTT TTT CTT TTT CTT TTT Til ^ CTQ QGT 

2430 AGG ACC TTC CAA GGA AAA GGG ATG CTG TAC ^ ^ 

2475 GCC TGG CCT CCA GTG TCC CAC CTC CTT CAC ^ ^ 

2520 CTT TGC CAT CTT GAT GCT GAG GTT TCC TGT ^ 

2565 TGT TTG TGG TAA AAG AAA GGA AAG GGC TTC TGA ^ ^ ^ ^ 

26X0 AAG CTT ACC TGT GGG TTT CAG TCC T ^ ^ ^ ^ ^ ^ 

2655 CAT CAG CAC TGT CTw J. CTG CCC CCA CCC 

2700 CTG CCT CTT TGG CTT CAT GGG TTT TTC ^ ^ 

2745 CCA CAT GTG CAA TCC TCA AGA TTT GTC CTG ^ ^ ^ 

2790 ACC TCC CTG CCT GTC CTT GGG GAT TCT ACT ^ 

2835 CCC ATA GCT GTT GTC TAA CAG GTA AGA AAT GAA A ^ 

2880 ACT GGG CCC CAG AAA TCC ATA AAA TGG CTG CAG ^ 

2925 IgA GCC ATT CTA £ £ GAA TGT TCT Et GTT GCA AAC TTG 

2970 TGT AGA GCC ATT CTA TAT faU cgg qga caa 



;s s s s « » s S s s ss ss s S » oc T 

3M0 ^ £c « TTT TCT ™ AAC TAC TCA " TCC TCC AGG TCC CTT 
3105 T " 1^ If! I£ SI r.CC CAC TCT TTT AGC CCT TAC CTT MA TCT 



CAT CTT CCA TAC CTA CCC CAC TCT TTT ACC CCT £ ^ ™ j» 
71 « CTC AGA TAA GTT GGT TCA CAA AGA ATG TTA Afai * 
^ GTG ACT GAG ACC AGA GAT GGC AAA TGA ATG GCA CAC CAT TTC TCC 
TTC TCC TGC CCC AGG GCA GGT ACC ACT GAT CTG CAT CAG AG^ ^ 



3195 ere AGA TAA GTT -J V ." . ," ^ _ ^ 3CJ - CAT TTC TCC 

3285 TTC TCC TGC cue ow* — C CT CAA GCA 

3330 CTG CTA TTC TCT GGT GTA TCC TTC ACA TCT ^ 

33 75 GCT GTG TGA GTG TTG AGA TCT CTG CCA TCT CTG 

420 CTG TCC TGT GAA GTG TTT CCC ATG ACC TTT TTC TT 

3465 TCC CTC TTG TCT GGA GTA GTC CTT GCC TTC TTC in* 
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TCT 


CCA 


GCT 


GCT 


GCC 


CCA 


4455 
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TGG 
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CCC 


CAG 


GGA 


CAG 


4500 


TCC 
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GGG 
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GGA 


GCT 


GCC 
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TAT 
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ACC 


CGG 


CCA 


CTC 
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AGC 
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GTC 


TCC 


TTT 


GGG 


4635 


TTT 


TGA 


TGT 


ATT 


TTA 


AAC 


CAC 


4680 


AAA 


AAA 


AAA 


AAA 


AAA 


AAA 





TTG TGC CAG GCT AAG CTG GTA CAA 
TTG CTA GGC GAG AGA GGT GCA GGG 
TCC CCC TTG AAG AGA GGG GAA AGG 
GCC TGA CTC ACA CTT CAG CTA CCT 
GGA GCT GCT GGA TCC CAG TGT GGT 
CAG GTG GCC CCA GCT GGG TTT CCC 
GGC AAG GTG CAG CCT TTG CTC ACA 
TCA GGC CAG TCT GCA GAT GAG GTT 
ATT GAC CAA ATC AAC CAA TCA CTA 
CCA AAG TAG CCC AGG TCC TGG GCC 
CCA TGA GTG AAG GCC AGT GTC TTC 
TTG TGA CCC TCA GTT TTA GGA CCC 
GAT TGC TAG CTT TTC CTC CAG GGG 
GAG CGC ATG GCT CTG CTA ATA GTA 
CAG CTG AGA GCT TCA TGT CCA CCA 
CAC TTT TTT TTT TTA TTT GTT GTT 
ACC ATG GGC TGA GCT CAG GCA C/TT 
TGT AAA GAT GGT TAT TTA ACC CTC 
CTG AGG GCT GAC CCG GAG GCC AGT 
GGG AGG GCC AAG GCC TGC TGA GCT 
GCC TTT CCG CCT TGC ACA GCA CAG 
CCA GGC ACC TGC TCC TCT TGC CCT 
TTC TGT CCC TGT AAC TGC TTT CCT 
AGA CTT GTT TGA AGC TGC ACT GGC 
TAT TCA CAA CAG CCA GGG ACT TGA 
ATT AAA TAA AGA GTC TGT TGC CTT 
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GTG GAC GTG GAT GAG TAC GAC GAG AAC AAG TTC GTG 
val asp val asp glu tyr asp glu asn lys phe val 

GAC GAG GAA GAC GGC GGC GAC GGC 
asp glu glu asp gly gly asp gly 
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Rabbit MSKNTVSSAR FRKVDVDEYD ENKFVDEEDG GDGQAGPDEG EVDSCLRQGN 
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100 

51 

Rabbit MTAALQAALK NPPINTRSQA VKDRAGSIVL KVLISFKAGD IEKAVQSLDR 
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Human *• 

101 . 150 
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1 50 
Rabbit EERVLEKEEE EEEEEDDEDD DDDWSEGSE VPESDRPAGA QHHQLNGGER 
Human DDD . . EDEE 

51 ; 100 

Rabbit GPQTAKERAK EWSLCGPHPG QEEGRGPAAG SGTRQVFSMA /ALSKEGGSAS 
Human . . .S V. ..TP Q. .D P /.MN....T.. 

101 150 
Rabbit STTGPDSPSP VPLPPGKPAL PGADGTPFGC PAGRKEKPAD PVEWTVMDW 
Human VA P S 

151 200 
Rabbit EYFTEAGFPE QATAFQEQEI DGKSLLLMQR TDVLTGLSIR LGPALK I YEH 
Human 

201 220 
Rabbit HIKVLQQGHF EDDDPEGFLG 

Human D FIG 20 
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1 AAGCTTTATAAAGATTTAACTACCTAATAAGGTAGAGAAGTAATTTATGTGCCCACTAAA 60 

61 AAATACTCAATTTCTGAATGTTCGTCCAAAATTAACTTGTCAGATCATTAAATCATTGAC 12 0 

121 TAGAAACACGTTGAGTACCTATTATGTACTAGGCACTTAGATCATTGTGAGACAATAAAA 18 0 

181 AATACTGCATTAGAAAAGGACATTTTTCACATCTTAAATGCAATAAGCATTATTTGGCTG 24 0 

241 GCAGTTAATTACATTTAACACATTAAACATATAGAGCAAAATTCTGAGCAATCAAAATAA 300 

301 TTATACCCTTGAGCAATCGATTATTTAAATTTCTTTCACTATTCCCTTAAGCTGATTTCT 360 

361 ACTCTGGGATTCTTTCATAGTTCTCAAATAAGAAAATAAAAAATTTCCTAAATAAGGCAA 4 2 0 

4 21 TACAAAAGAATAGAAATGTAAGAGAAGAGATATATTAGCTCTTGAATCCCTGTTTCCATT 4 8 0 

4 81 TGCTGTCAATAGTGCCTCTAATGTTCGATTTTCTCTTCAAAGAAAAATCTTGATTTAAAA 54 0 

541 GGAAGAAAAAGTACAATCACCTTTAACAGCTAAAGTATACTGATTAGCATCTACTAAAGT 600 

601 TAGCAAAGACTGAAACTGAAAAAAAATTGTAAAATCTTTATTCTAAGTTATATAACGCCA 660 

661 TTCACCATAGTAATGATTTTATACTTTGGTATATGGCTTTTTAAAATAAATATTGCCAAC 72 0 

721 AGGTAAAAATTTTTCCTTTGCTGTCTTAAGGCATTCCTAAGAGAATTTTTACCAGTGTGT 7 8 0 

7 81 GTTCATAACTTGAATGTTAATTTAAACAATGTTACTTCTATCACCTAAATGATATACTTA 8 4 0 

841 T AG AAG AG T G G T T T AAT T GGG AAC AG AAAAAC AC C AC AT TGCTTCTTC C C AAG AAAAAG G 900 

901 GATGTATTCCATTCTCGAGGTCTCTCTCCCACTCTCTATTTATATATAATATACTGCATA 960 

961 GATAAATATACACACATTATATATGTATTTTTTTGAACTTAAAGAAGACTGGACATATGT 102 0 

1021 ATTTACATGTATATATCCAACAAATATTTAATTTTGAGATCTCTCTCCCTCTTCTGATTT 10 8 0 

1081 ATTATTCTCAGTATGAATTCTCAAACTGTACGGTCTTTCACATTTCATTCATTCATCAAG 114 0 

1141 CATGTATCGAGTCCCTTCTGCATGCTTAGCTTTTTGTCATATGGAAGGAAGATACAAAAG 12 00 

1201 AAAAACTGTTTCTGCCCTTCAGAATCTTTCCATCTCTTCTAGGAAGGAGATAAAACACCA 12 60 

12 61 TATATCATTAAGAAATTTATAAGACTAGTCCCAAAACCAATGGTACAAGCAACATGCATT 132 0 

1321 TTACATTTATGTAGAATTTTAGAGCTTGGAAACACTTTCGTGATATATAATCCTAAGAAC 138 0 

1381 AATCTTGTAAAGTGCACATTATTAGCTCCATTTCAGTGATGAGGAATCTGAGACAGAATT 14 4 0 

14 41 TTAAGTGACATGTCTCGTTCAAACATTATGAGTGGAAGAGTCAACACTTAAGCCTGAGTT 15 0 0 

1501 TTCTGATTCTAAGCCTAGTGCTCTTTTCAACACAGCACTGGAAACCAAAGATTGTGGTAC 15 60 

1561 ACAACAAGGCAACAGCCAGTCTTCTTGCTCGAGGTCCAACTAAACTGGACCCATACCGAG 162 0 

1621 CAGTGTCCAGCCAAATGTCCAAATTAATTTTATCCTGCAAATATTTGTTCTTCAGTGTAA 168 0 

1681 TACACACAGCACAACTACCATTTCCTTCGTCTTAGTGCCTTTATCTCCTACATTCCAGAA 17 4 0 

17 41 ATGGGGATGTCAAATATTTTTTTAAATCTGGCCTAGATGGAATCATATAAATCTCAAATC 18 0 0 

1801 ATAATATAAATCTTAAAGGTCTGGTTTCCACCAATCCTTCCACATTTTGTTTTCCCCCAG 18 6 0 

1861 CACTAGAGAGCCTAACCTACCCTCACCCCTTTCGAGCATTCTTGCTCCAAACGACCACCT 192 0 

1921 ATTTTAAGATGTCAATGACCCTTTCCCAAATTCTACAAATTCACCCCAGTTTTGCCACCC 198 0 

1981 GACCCCAGCGCCTGCCCGGACACGTTCCCCTCCCTCCCAATAGATTTGATACCGAGTTCA 20 4 0 

2041 GGTTCTGCAGATCCCGTTGCGATGCTGTCACACAGCACTGACAGATAAGATTTGACCTTT 210 0 

2101 CGACTCCGTCCTTGGGGACTTCCCGCTGGCCAAGAAGGGTAGTTCCAATCCCAGGAAACG 2160 

2161 GGCTTCCTGCTCAGGAACGCAGCCTCTAGCAGCGCACAGTCTGAGGCAATGTCTCCGGCA 22 2 0 

2221 ATTAGAACGATGCTGGGCGCCCGGGTGTGCATCACTCTGCCTCATACTCCTACCAACTGC 22 8 0 

2281 AGGGCACTCGGTCCGGCAGCCAGTCCATCCCACCCACACCCAAGTCCCAGCCAGCCGGAC 2 3 4 0 

2341 CTTACGCAGGACCCCGATGATAGGTCGTTGACGGCTGCAGCAAAAGCCAAGGCCACCTGC 24 00 

24 01 CGCTGCTGCCCATCCCCGCCAATCTGAGACCCCCTAGACTGGACCGC AGAAAAGCGTTTC 2 4 60 

24 61 TATGGGAACCCCCCCACCGAGAATCACGTGACGCAATCGGACGACCAATCGCTTCTTACC 2520 

2521 TCTGCCCGCGGTCCAGCTTTTGGCCCTCCCTCTCGCCCCCGCCTCCTTCGCCCAGCCCCG 25 8 0 

2581 CCCCTTGCCTGCGGAGAGCCCGCGCCTGCGCGCTGTGTCCTGCGCGCTCCTTCCCTCGCG 2 64 0 

2 641 CGCGCTCTCCGTGGAAGAGCAGGGGCAGCGTGGGAGGCGCCAAGGGAGCGCGAACCTGAG 27 00 

27 01 GAGGAAGAAACGGGGCTAGCGCGCAGGCCCAGAACGGTCCGAGCCGCGGCAGTCGGCGAC 27 60 

27 61 GCCTCAGAGCGGAAGAGGGAAGTGAATCAGGCGCCGGGTAGTGGGTTGCTGGGCTGGGCT 28 2 0 

2821 TGCTGAGGTAGAGGCAGCGCCAAGAAGAGGCCTTTGCCGCTGGTCGGGATTGGGATGTCG 2 8 8 0 

M S 

2881 AAGAACACAGTGTCGTCGGCCCGCTTCCGGAAGGTGGACGTGGATGAATATGACGAGAAC 2 9 4 0 
KNTVSSARFRKVDVDEYDEN 
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2941 AAGTTCGTGGACGAAGAAGATGGGGGCGACGGCCAGGCCGGGCCCGACGAGGGCGAGGTG 3 O 0 0 
KFVDEEDGGDGQAGPDEGEV 

3001 GACTCCTGCCTGCGGC/^ATATCCTTGCATTCACCGCCCTCCCCACCCCAGCCCAGCCCAG 30 60 
D S C L R Q 

3061 CCCGCCCTTCTCCTGGGACCCGGGAGCCTGCAGGATCCGCGGGGCACCGGCGCGGAGCTG 3120 

3121 CCTCTCAACCTGCGGCTTAACCTGTCTCTTTGGGATCGCCCGCTCTGAGAGGGCAAGGGG 318 0 

3181 GAAGCCCCCGTTTCCTACCCAGTCGGCAGGAGACGCGAGGGTCCCACTCTTGGAAGCCTG 32 4 0 

324 1 CCCTACCCCGCGCGCCTTCCACGCCCCCAGATTCCTCAGGTTGCACCCGAGTGCCTGCCT 33 00 

3301 GCCTCGGGAACTGGTCCCGCCGCCCGCGCCCTCGCGGCGCTGGGGAAGGCGGCCCCGGCT 33 60 

3361 GGTGGGGAAGGCTGGTGCCGACCGCCTTAGTTTTTCTTCCTAGAACTCTGATTTCCTGGG 34 20 

34 21 GTCACATTAGCTCCAGAAATTTCTGATTGTGGGGAACCTGCATCTTTCCTTAGTGGTTTT 34 8 0 

34 81 GTTTTTTGGTTGTGTTTTTGTTATTGGTAGCGTTAAGGTAGTTTATTGCTTACCGGGGGG 35 4 0 

3541 CCGGGGGAGATGGGACTGTTCGAAAATTGAGGGTCCCTGTGCTTTCAGCCCATTGGCCTT 3 6 00 

3601 TTTAAAAAAAAAAAAAAAAGAAGAAGAAGAAGGGGATTTGGCAAAATATACATTGTACAG 36 60 

3661 AATTTGTTAACTGGGGGAGGGGAATGAATACAAAAAATACAAAACTCCTAGAAGGAAGCT 37 20 

3721 TGGAGCCTTTTACCTGCTAAGAAAAGGACAATAGAAAAAACAACGGGGAATGCGTGTGGA 37 8 0 

3781 GAATCCTTGGAAATATTTAAAATAAACCCCAATGAATAAGATAGAAGATGAGTCATTCGT 38 4 0 

38 41 ATAAAGCAGAATCATTTTTGTAATCCTAAAATTGTTTCCATTTTAGTTAAAATATGGCAG 3 9 O 0 

3901 TCAGTTCCCGGTTTCTGTTTTTGCATATTTGAATATTCATAACTTTGGCTTCGCATTTGC 3 9 60 

3961 ATTACATCTTTTTTAGAAAAATGTAAATGTTGCAAAAAAACCGAAGCTGTAGTTTTAGAA 4 O 2 0 

4 021 AATCTCAGACACTGAATTTGTATGCATTTCTAATTCTTGGGTGTATTCATAAGGAAGACT 4 0 8 0 

4 081 CTCAACAATGTCCTGTTATAGTGGGGAAATATGAGAGTGAAAATATTTAATGGCAACAAT 414 0 

4141 ATCCTTTTTTAAAGGCACCTAAATAGAGCATTAGACATTTATCAATATATAGATAGTGCT 4 2 O 0 

4201 TTGCCCAACTTTCACAATTAATTAGCTGTTGCTCTTTTGCATTATTTAAATACTTAAGTG 4 2 60 

4261 CTTGGAGTTATAAAAAATGAGCTAATCTACATCAGGCATGCTTCTCTAGAAATCCCTGCA 4 3 20 

4 321 GCCTTGAAAATAACAGCTTGTCAACCAGAGATTTTGTGTAAGAACTTTTTCTTTAGAAAA 4 3 80 

4 381 TAAATGGTGAACATGCTTCCTAAAAACATTATTTGTGATGGGATAAGATGGTGTTTTATG 4 4 4 0 

4 4 41 AAACCCCAGTGTATTTTAGGTAATTTGTGGTGACTTTTAAAAGGTACTGCTGTATCCATA 4 5 O 0 

4 501 TCAGTGGATCTGCTTTTTGATCAGTTCATCTTAAAATATAAAGATACTGTCTCTTCTTAC 4 5 60 

4 561 CGTTACATACAGCCAGGAAAGACAGCCCTAGTGGTGGGGTACTAGAGTTGGAGGAACAAG 4 6 2 0 

4 621 TGAACTCTGTGGTTTTCCTTTTAGGGGAATGTTTGTACATTCTGACAGTCTGATTGGCCT 4 6 8 0 

4 681 TCTGTTTCTCATGCTTGCTAACTCACTAGTGCTTTCAAAGAGAGCCTGAATTTAATAGGT 4 7 4 0 

4 7 41 ATGGTCTAACACAGTTTGAATAACCTTTGTGAAATATGAGAGAAAATATCTAAAGCAAAA 4 8 00 

4 801 AATTAAGCTGCCACCTAAGGGACATATGAATTATT ACATCTTCTGTGATGCCTCTTTTCA 4 8 60 

4 8 61 TCAATATTGAGAGATTGCTAATGTGTATCATTCAGATTGCT AATCTGCCAGCATGTTCTA 4 9 20 

4 921 CCAGCATTTCAGATAATACAGAATATGGTTCTAGCAAAAGTTTGGTCTTTATTTTTTCAA 4 9 8 0 

4 981 TTAGAATCACAGGAAAAGACATATTTTGGTTGATAATAGGTTATTTCATTTGGGGGACTA 50 4 0 

5041 ATAATTCTGATATATATTTTAGGATTTCTTTAACACCACTCTAGGTAATGTTTGCATATG 5 1 00 

5101 TATCTCACTGGGAAATGAAAGACTATCAAGGTGTTCACTTG ATAGTTAGAACCAAGGGTG 5160 

5161 AAACAGTCTTTGCTTTATTAAAAAAAAGTCTAATGTTCTATTTTGCTTTTGATATTTTGC 52 2 0 

5221 CTTTGATTAACATCCTGGAAACCAACACATTGAATTTCCAGTATTGAACATAGTGACCAA 52 8 0 

5281 AGTAATTTTCTTTTTATATGTAAATCAAGTCATAAAGAACCAGTGGTTATAATGCTTTCT 53 4 0 



534 1 GGGGGCCATCCTTTGCTGTTACACCCTTAACTTCCATCACAGGAAACATGACAGCTGCCC 54 O0 

G N M T A A L 

54 01 TACAGGCAGCTCTGAAGAACCCCCCTATCAACACCAAGAGTCAGGCAGTGAAGGTGAGTC 5 4 60 
QAALKNPPINTKSQAVK 
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5761 ^m^^arnTaTrAP^GGTTAAAAAGATAGTGGTTTGTGGTCCCAGCACCCAG 5880 

5821 TAGAAAGCTCAGACATATGAGAAGGTlAAAAa^fti^i^ 

IS iS=^== is 

6061 TAACTAATCTTTTGTTTATATTT^ACTGACTT^ 61 Q Q 

6121 TOATATmWGe^ 62 4 0 

6181 CTG(»TCAAAGAATGCACTTTTTTTTAA^i 1 1 63 Q() 

SSI S~SS™™Un = .360 

67 2 1 AAGCAGGCTTTTGCCAAAGAAAAAAGATCA^ 68 4 0 

7141 TTGGCCAGGCTGGTCTCGAACTCCTGACCTCAGGTGATCTGCTC 

720 1 GTGCTGGGATTACAGCTGTGGGC^CACCACA^ 7320 

7 261 taagatcaagacataaattta^tgttgto 7380 

7321 TGTTAGCAAATCCTCTCAGCTCTG^ 74 4 0 



7381 
7441 



7501 cgaaaggmtgctgttaaaj^^^ , 62 o 

sssssssssssssssssssss.^™ .... 



7561 
7621 
7681 
7741 
7801 



7,«1 ATATGTCTAAATGTTCTGTTATTTATTGACCCACCAGGACCGGGCAGGCAGCATTGTCTT 
7921 GAAGGTGCTCATCTCTTTTAAAGGTAATGATATAGAAAAGGCAGTTCAATCTCT 

7981 GAATGGTGTGGATCTCCTAATGAAGTATATTTATAAAGGATTTGAGAGCCCGTCTGACAA 

NGVDLLMKYIi* 
e0fl TAGCAGTGCTATGTTACTGCAATGGCATGAAAAGGTAAGTTATGAATTATAAATCTATAT 

............. RG _ 22C 



7920 
7980 
8040 
8100 
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8101 
8161 
8221 
8281 
8341 
8401 
8420 
8480 
8540 
8600 
8660 
8720 
8780 
8840 
8900 
8960 
9020 
9080 
9140 
9200 
9260 
9320 
9380 
9440 
9500 
9560 
9620 
9680 
9740 
9800 
9860 
9920 
9980 
10040 
10100 
10160 
10220 
10280 
10340 
10400 
10460 
10520 
10580 
10640 
10700 
10760 
10820 



^Stgagccaaacacaaaagaactaatattttataattttacttatatgaaataatc 
SgSggcaaa^caaagggacagaaagtccttagaggttac 

r^GGAATAGGGAGTTAGTGCTTAATAGGTACAGAGTTCCTCCTTGGAGTGGTAAAAAAG 
™ScAGATAGTGGT 

TA^G^TTAGGAGTTTGGAGGATCAAGAATTCTTAAATCATGTTTTTCTATTTTCATGT 

gSSSSaagtagatgctggtaca^ 

TTrArCTTTGGGTAAAAAACTGGATAACATGTAAATAGAACCAGTCATAAAAATATTGAG 
ArTTCCC^CTTTTACTCTAAAATGCACCAATTTGGGTAGTAATTTATATGGCACCCTAT^ 

SggScacSctgtt^ 

Bnrrr^TGG^GTATATAAATATCTTTATCATCCTCAATTTACAGATGAAAAGCTAGCT 
nnnrrn^rrArCGTAGTTCTAGCATAGCCTCAAGATTGGAGTGAACATTGATTAC 

T ^li^ISrAAATGCTTGCTATTATTCCTCTTTTGTCACTGAGGAAACTGAGGCATG 

Sg^SmSgttcaagcgtatctaatgcagctgttcacagcatctctcccag^ 

SrrArrPArCCATCAAAATGATATCTTTAAAATCAAAATTGCCCTTGTCAGTCACCTGCA 

rr acgcccggctaattttttgtat^ 
S?gSgScSgacctcgtga^ 

ATGGTCTTGATCTCCl^ui CAGATTTTTTAAAAC ATTATTTAGAAT 

^IS?S™?tattactgtaacctcagtgccaagaacagtacctggtgcataattg 
SSSgStgttttaccaaatatgagaattgaccttttattattctgtcaaca 



8160 

8220 

8280 

8340 

8400 

8419 

8479 

8539 

8599 

8659 

8719 

8779 

8839 

8899 

8959 

9019 

9079 

9139 

9199 

9259 

9319 

9379 

9439 

94 99 

9559 

9619 

9679 

9739 

9799 

9859 

9919 

9979 

10039 

10099 

10159 

10219 

10279 

10339 

10399 

10459 

10519 

10579 

10639 

10699 

10759 

10819 

10879 



10880 



TTTACATCCTGGTTTGTTTTTAGGCACTTGCTGCTGGAGGAGTAGGGTCCATTGTTCGTG 10 939 



10 940 TCTTGACTGCAAGAAAAACTGTGTAGTCTGGCAGGAAGTGGATTATCTGCCTCGGGAGTG 

ltarktv* f| q 



10999 
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11000 GGAATTGCTGGTACAAAGACCAAAACAACCAAATGCCACCGCTGCCCTGTGGGTAGCATC 11059 

11060 TGTTTCTCTCAGCTTTGCCTTCTTGCTTTTTCATATCTGTAAAGAAAAAAATTACATATC 11119 

11120 AGTTGTCCTTTAATGAAAATTGGGATAATATAGAAGAAATTGTGTTAAAATAGAAGTGTT 11179 

11180 TCATCCTTTCAAAACCATTTCAGTGATGTTTATACCAATCTGTATATAGTATAATTTACA 11239 

11240 TTCAAGTTTAATTGTGCAACTTTTAACCCCTGTTGGCTGGTTTTTTGTTCTGTTTTGTTT 112 99 

11300 TGTATTATTTTTAACTAATACTGAGAGATTTGGTCAGAATTTGAGGCCAGTTTCCTAGCT 11359 

11360 CATTGCTAGTCAGGGAAATGATATTTATAAAAAATATGAGAGACTGGCAGCTATTAACAT 11419 

11420 TGCAAAACTGGACCATATTTCCCTTATTTAATAAGCAAAATATGTTTTTGGAATAAGTGG 114 79 

114 80 TGGGTGAATACCACTGCTAAGTTATAGCTTTGTTTTTGCTTGCCTCCTGATTATCTGTAC 115 39 

11540 TGTGGGTTTAAGTATGCTACTTTCTCTCAGCATCCAATAATCATGGCCCCTCAATTTATT 11599 

11600 TGTGGTCACCCAGGGTTCAGAGCAAGAAGTCTTGCTTTATACAAATGTATCCATAAAATA 11659 

11660 TCAGAGCTTGTTGGGCATGAACATCAAACTTTTGTTCCACTAATATGGCTCTGTTTGGAA 11719 

11720 AAAACTGCAAATCAGAAAGAATGATTTGCAGAAAGAAAGAAAAACTATGGTGTAATTTAA 11779 

11780 ACTCTGGGCAGCCTCTGAATGAAATGCTACTTTCTTTAGAAATATAATAGCTGCCTTAGA 118 39 

1184 0 CATTATGAGGTATACAACTAGTATTTAAGATACCATTTAATATGCCCCGTAAATGTCTTC 118 99 

11900 AGTGTTCTTCAGGGTAGTTGGGATCTCAAAAGATTTGGTTCAGATCCAAACAAATACACA 11959 

11960 TTCTGTGTTTTAGCTCAGTGTTTTCTAAAAAAAGAAACTGCCACACAGCAAAAAATTpTT 12 019 

12020 TACTTTGTTGGACAAACCAAATCAGTTCTCAAAAAATGACCGGTGCTTATAAAAAGTTAT 12 079 

12080 AAATATCGAGTAGCTCTAAAACAAACCACCTGACCAAGAGGGAAGTGAGCTTGTGCTTAG 12 139 

12140 TATTTACATTGGATGCCAGTTTTGTAATCACTGACTT ATGTGCAAACTGGTGCAGAAATT 12 199 

12200 CTATAAACTCTTTGCTGTTTTTGATACCTGCTTTTTGTTTCATTTTGTTTTGTTTTGTAA 12 2 59 

122 60 AAATGATAAAACTTCAGAAAATAAAATGTCAGTGTTGAATAATTTATTTTTCTCTGACAC 12319 

12320 TTTAACAATTATGAATGTATGGTTAATTAAGAGGAAAGGTTTTCTGCTTCTACCACCAAG 12379 

12380 TACTGTACTCTTAACAAGAACAGTTTGGTAGGGTTTTTATAAGACTATATAGATATAAGA 12 4 39 

12440 TGATAGAGAAGAGAGTCATGAATGATGTCAGAGCACTACTGAAGCCTTTGGAGTGATTCC 12 4 99 

12500 ATAGCCTTCTGGATGGCAGCTGAATACCTATATGTAGTATCACTGCCCAAAGACCTAGAC 12 559 

12560 TAGAAAGTGCAAAGTAGCTTAGCAGCTGCAGTCATTCACTCCCAGCCTCCAAAATTCTCT 12 619 
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1 - GATCCCTCTCCAGGTGGAAG - 60 

61 - CTCCCTTCATACCAAAGTTTAAAGGCCCTGGGGATACGAGTAACTTTGACGACTATGAGG - 120 
121 - AAGAAGAAATCCGGGTCTCCATCAATGAGAAGTGTGGCAAGGAGTTTTCTGAGTTTTAGG - 180 
181 - GGCATGCCTGTGCCCCCATGGGTTTTCTTTTTTCTTTTTTCTTTTTTTTGGTCGGGGGGG - 24 0 
241 - TGGGAGGGTTGGATTGAACAGCCAGAGGGCCCCAGAGTTCCTTGCATCTAATTTCACCCC - 300 
301 - CACCCCACCCTCCAGGGTTAGGGGGAGCAGGAAGCCCAGATAATCAGAGGGACAGAAACA - 360 
361 - CCAGCTGCTCCCCCTCATCCCCTTCACCCTCCTGCCCCCTCTCCCACTTTTCCCTTCCTC - 420 
421 - TTTCCCCACAGCCCCCCAGCCCCTCAGCCCTCCCAGCCCACTTCTGCCTGTTTTAAACGA - 4 80 
4 81 - GTTTCTCAACTCCAGTCAGACCAGGTCTTGCTGGTGTATCCAGGGACAGGGTATGGAAAG - 54 0 
541 - AGGGGCTCACGCTTAACTCCAGCCCCCACCCACACCCCCATCCCACCCAACCACAGGCCC - 600 

Human cAMP-dependent protein kinase 
catalytic subunit alpha 
Accession number X07767 (until *) 
follow arrow until line that 
begins 1561 - 

601 - CACTTGCTAAGGGCAAATGAACGAAGCGCCAACCTTCCTTTCGGAGTAATCCTGCCTGGG - 660 
661 - AAGGAGAGATTTTTAGTGACATGTTCAGTGGGTTGCTTGCTAGAATTTTTTTAAAAAAAC - 720 
721 - AACAATTTAAAATCTTATTTAAGTTCCACCAGTGCCTCCCTCCCTCCTTCCTCTACTCGC - 780 
781 - ACCCCTCCCATGTCCCCCCATTCCTCAAATCCATTTTAAAGAGAAGCAGACTGACTTTGG - 84 0, 
841 - AAAGGGAGGCGCTGGGGTTTGAACCTCCCCGCTGCTAATCTCCCCTGGGCCCCTCCCCGG - 900 
901 - GGAATCCTCTCTGCCAATCCTGCGAGGGTCTAGGCCCCTTTAGGAAGCCTCCGCTCTCTT - 960 
961 - TTTCCCCAACAGACCTGTCTTCACCCTTGGGCTTTGAAAGCCAGACAAAGCAGCTGCCCC - 102 0 
1021 - TCTCCCTGCCAAAGAGGAGTCATCCCCCAAAAAGACAGAGGGGGAGCCCCAAGCCCAAGT - 108O 
1081 - CTTTCCTCCCAGCAGCGTTTCCCCCCAACTCCTTAATTTTATTCTCCGCTAGATTTTAAC - 114 0 
1141 - GTCCAGCCTTCCCTCAGCTGAGTGGGGAGGGCATCCCTGCAAAAGGGAACAGAAGAGGCC - 120 0 
1201 - AAGTCCCCCCAAGCCACGGCCCGGGGTTCAAGGCTAGAGCTGCTGGGGAGGGGCTGCCTG - 1260 
1261 - TTTTACTCACCCACCAGCTTCCGCCTCCCCCATCCTGGGCGCCCCTCCTCCAGCTTAGCT - 1320 
1321 - GTCAGCTGTCCATCACCTCTCCCCCACTTTCTCATTTGTGCTTTTTTCTCTCGTAATAGA - 1380 
1381 - AAAGTGGGGAGCCGCTGGGGAGCCACCCCATTCATCCCCGTATTTCCCCCTCTCATAACT - 14 4 0 
14 41 - TCTCCCCATCCCAGGAGGAGTTCTCAGGCCTGGGGTGGGGCCCCGGGTGGGTGCGGGGGC - 150O 
1501 - GATTCAACCTGTGTGCTGCGAAGGACGAGACTTCCTCTTGAACAGTGTGCTGTTGTAAAC - 1560 
1561 - ATATTTGAAAACTATTACCAATAAAGTTTTGTT * TAAAAAAAAAGTGTCGCTGGTGTTCTC - 162 0 
1621 - GACTTCGATCACCCACCCACACACCCCCAGGGGGTTGGAAAGGGAATTTCGGACCCCAGC - 168 0 
1681 - GTGCAGGCCGATCAGGTCCTGGCTTGAAGTCCTTGTAACCAGGGTTTAGCTGAAATTCCG - 17 4 0 
17 41 - GCACTCCTTCGGCCCCGCAGGAGAAACGAGCGTCAAACTGCCCTTTGACCCCAGATTCGG - 1800 
1801 - GGTCCCCAAATCTGCGGCGCGCCCCCTCGGCGTCCAGCCCGGGACCGAGAGGGCGCTCTA - 18 60 
1861 - GGGAGGCGCTGGGGCTGGCGCGCCAGGAGGCCGAGCGGCGGCGGGGGCGGCCCTGGCAGG - 192 0 
1921 - GGGAGTAGAAGGGGGAGAGGGTGCGCGCCCCCCTTCCCGCATCCTCAGCGCCGGGCCAGG - 198 0 
1981 - CGCGCCTGAGGGACGCGGGGGCGGCGGCAGCAGGAGGGTCCCCGCAGCACCCTGCGAGCG - 204 O 
2041 - CGGCAGCCCCGGCCCGCGGGCGGCGAGTTCCCGGTAAGTGCGGTCCCGAGAGCGGAGCGC - 210O 
2101 - GCTGGAGAGGCGTGGAGAGGGGGGCTGGGCGCCGGGGACGTCTGGGTCCCGCGCCCAATG - 2160 
2161 - GCTGGAGGGCGGCCGAGCGCCGCCCGCCCGCCCTGCCCGCCCCCTCTCCCCTCCCCCCGG - 2220 
2221 - CACTCCCCTCCCCCTCCCCCGCCCGCCGCTTTCCCCCGCCCCCGCCCCGGCGCCAACTCC - 228 0 
2281 - GCGGCGCCTCCTTAAAAAGCGCGCGGGAGTTGTAAGGGGGGGCCGGAGCGAGCCGGAGTG - 234 0 
2341 - AGCGAGAGCGCAGGGTAAAGGGGGCGGGCGGGGGGCCCGGGCTCCACCTTAAAAGCGGGC - 24 0O 
2401 - GCGTGGGGGTGGGAGGGAGGAAGGCGGGCGGCGGGGAGGAGGGAGGGAGGGAAGGAAGGG - 24 60 
24 61 - GGGCCGGAGTGTCCCGGGCGCAGGGCGCGCGTGCGGCGGCGGCGGCGGCGGGGAGGGGCC - 252 0 
2521 - GGCCGCGCCGCGCTCCCCTCCTCCCCCTCGCATCCCCGGCCCCGCGCGCGCCCAGCAGAA - 258 0 
2581 - GCGGGTCTGTGTGTGCGTGCGTGCGAGTGAGTGAGTGTGTGCATATTTTTTTCTCTCTTT - 264 0 
2641 - TCTTTCTCTCTCACTGTTTTTTCCTCTCTCTCTCTCTCCCTCTCTCTCTCTTTTTTTTTT - 270 O 
2701 - TTTTTTTTTTGCAAAGAAACAGCAGCGCCGCCGCCGCTCCGCCGAGGCGCTGCGCCCCCC - 27 60 
27 61 - GGGGGGGGAGGCGGAGGAGGCGGGCAGCGGCGGAGGGAGGGGAGCCGGGGAGGGGGGCGC - 282 0 
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2821 - CGCGCTGGGAGGGAGGCAGCGCGCACGGTGCAGCCGGGCCGGGCGGGAGGCATGGCGGGG - 288 0 

MAG 

2881 - CCCCCGGCCCTACCCCCGCCGGAGACGGCGGCGGCCGCCACCACGGCGGCCGCCGCCTCG - 294 0 
-P PALP P PETAAAATTAAAAS 

2941 - TCGTCCGCCGCTTCCCCGCACTACCAAGAGTGGATCCTGGACACCATCGACTCGCTGCGC - 300 O 
-SSAASPHYQEWILDTI DSLR 

3001 - TCGCGCAAGGCGCGGCCGGACCTGGAGCGCATCTGCCGGATGGTGCGGCGGCGGCACGGC - 30 6 0 
-SRKARPDLERICRMVRRRHG 

3061 - CCGGAGCCGGAGCGCACGCGCGCCGAGCTCGAGAAACTGATCCAGCAGCGCGCCGTGCTC - 312 0 
-PEPERTRAELEKLIQQRAVL 

3121 - CGGGTCAGCTACAAGGGGAGCATCTCGTACCGCAACGCGGCGCGCGTCCAGCCGCCCCGG - 318 0 
-RVSYKGS ISYRNAARVQPPR 

3181 - CGCGGAGCCACCCCGCCGGCCCCGCCGCGCGCCCCCCGCGGGGCCCCCGCCGCCGCCGCC - 324 0 

- R G A T P PAPPRAPRGAPAAAA 

3241 - GCCGCCGCGCCGCCGCCCACGCCCGCCCCGCCGCCACCGCCCGCGCCCGTCGCCGCCGCC - 330 0 
-AAAPPPTPAPPPPPAPVAAA 

3301 - GCCCCGGCCCGGGCGCCCCGCGCGGCCGCCGCCGCCGCCACAGCGCCCCCCTCGCCTGGC - 336 0 
-APARAPRAAAAAATAPPS PG 

3361 - CCCGCGCAGCCGGGCCCCCGCGCGCAGCGGGCCGCGCCCCTGGCCGCGCCGCCGCCCGCG - 342 0 
-PAQPGPRAQRAAPLAAP PPA 

3421 - CCAGCCGCTCCCCCGGCGGTGGCGCCCCCGGCCGGCCCGCGCCGCGCCCCCCCGCCCGCC - 348 0 
-PAAPPAVAPPAGPRRAPPPA 

34 81 - GTCGCCGCCCGGGAGCCGCCGCTGCCGCCGCCGCCACAGCCGCCGGCGCCGCCACAGCAG - 35.4 0 

- V A A R E P P L P P P P Q P P A P " P Q Q 

3541 - CAGCAGCCGCCGCCGCCGCAGCCACAGCCGCCGCCGGAGGGGGGCGCGGTGCGGGCCGGC - 360 0 
-QQPPPPQPQPPPEGGAVRAG 

3601 - GGCGCGGCGCGGCCCGTGAGCCTGCGGGAAGTCGTGCGCTACCTCGGGGGCAGCGGCGGC - 366 0 
-GAARPVSLREVVRYLGG SGG 

3661 - GCCGGCGGTCGCCTAACCCGCGGCCGCGTGCAGGGGCTGCTGGAGGAGGAGGCGGCGGCT - 372 0 
-AGG RLT RGRVQGLLE E E AAA 

3721 - CGAGGCCGTCTGGAGCGCACCCGTCTCGGAGCGCTTGCGCTGCCCCGCGGGGACAGGCCC - 378 0 
-RGRLERT RLGALAL PRG DRP 

37 81 - GGACGGGCGCCGCCGGCCGCCAGCGCCCGCCCGTCTCGCAGCAAGGTGAGCGCGCCGGGG - 38 4 0 
-GRAPPAASARPSRSK 



3841 - AGCGGGGGCGCCGCGCGGTGGGCAGGTGCGGGCGAAGTTGGTGGCGGGGGCGCGAGTCCC - 390 0 
3901 - GGGAGGAACTGGGTGGCGGGTGGCTGGGGCTTTGCGCGCGTTTCCTGCGGGCTCGGTGCG - 3960 
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3961 - TGGTGACCTTGGCAAGTGATTGAATCTCCCGGAGCCTCAGTTTCCTCCGCTGTAAACGCG - 402 O 

4021 - GTTTAATAACAGTAGCGACCCCTTGGGGTTGTTGAGCGAGTTTAGTAAGATTTGGTTGTC - 408 0 

4 081 - GAGGGCTTTAGTTAACACAGAGCCTGGCACGGAGTGAATGCGTAAAAGTTAGTCCGTATT - 414 0 

4141 - GTTCTTAAAGGTGGAATCGGTTCCTCCTCCCCACCGCCCGGACGCCACAGTCAGGGTCTG - 420 0 



4201 - GGATTAGAACAGCTACTAATTTTGCATGCTTCTCTCCTCGGCTCCAGAGAGGTGGAGAAG - 4 2 60 

- R G G E E 

4261 - AGCGAGTACTTGAGAAAGAAGAGGAAGAAGATGATGATGAAGATGAAGATGAAGAAGATG - 4 32 0 
RVLEKEEEEDDDEDEDEEDD 

4321 - ATGTGTCAGAGGGCTCTGAAGTGCCCGAGAGTGACCGTCCTGCAGGTGCCCAGCACCACC - 4 38 0 

- VSE.GSEVPESDRPAGAQHH Q 

4 381 - AGCTTAACGGCGAGCGGGGACCTCAGAGTGCCAAGGAGAGGGTCAAGGAGTGGACCCCCT - 44 4 0 
LNGERGPQSAKERVKEWTP/C 

4 441 - GCGGACCGCACCAGGGCCAGGATGAAGGGCGGGGGCCAGCCCCGGGCAGCGGCACCQGCC - 4 50 0 
GPHQGQDEGRGPAPGSGTRQ 1 

1 - AGGTGTTCTCCATGGCAGCCATGAACAAGGAAGGGGGAACAGGTAAGGATCCCTCTGGGT - 60 
VFSMAAMNKEGGT 



61 - GGGGAAGAGTGCTAGGTGGAGAGGAACTCAGCCCGAAGACAAAGCCAAAGACAGGTGTTT - 120 



121 - TTTTCCTTCCCAGCTTCTGTTGCCACCGGGCCAGACTCCCCGTCCCCCGTGCCTTTGCCC - 18 0 
ASVATGPDSPSPVPLP 

181 - CCAGGCAAACCAGCCCTACCTGGGGCCGACGGGACCCCCTTTGGCTGTCCGTAAGTTGGG - 24 0 
-PGKPALPGADGT P FGCP 

241 - GTATTGGAGACATGGGGGTGCTGCTCAGGTGTGTGGTACAGCCAGAGAGACATCCGTGTT - 300 

301 - CACTGGTGTCTGTTTGTTTTGATGCAGTCCCGGGCGCAAAGAGAAGCCATCTGATCCCGT - 360 

PGRKEKPSDP V 

361 - CGAGTGGACCGTGATGGATGTCGTCGAATATTTTACTGAGGCTGGATTCCCGGAGCAGGC - 4 20 
-EWTVMDVVEYFTEAGFPEQA 

421 - GACAGCTTTCCAAGAGCAGGTGAGTTTCCAGCCCAGGACTACACACTGACAGACACAGAG - 4 80 

- T A F Q E Q 

481 - GGCCTCCCTGGGATGTGCCCTGATCCCGGCTTTCTCTGTTCCTGTCCCACCCAGGAAATT - 54 0 

- EI 

541 - GATGGCAAATCTTTGCTGCTCATGCAGCGCACAGATGTGCTCACCGGCCTGTCCATCCGC - 600 
-DGKSLLLMQRT DVLTGLS I R 

601 - CTCGGGCCAGCCCTGAAAATCTACGAGCACCACATCAAGGTGCTTCAGCAAGGCCACTTT - 660 
-LGPALKI YEHHI KVLQQGHF 
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661 - GAGGATGATGACCCCGATGGCTTCTTAGGCTGAGCGCCCAGCCTCACCCCTGCCCCAGCC - 72 O 
-EDDDPDGFLG* 

721 - CATTCCGGCCCCCATCTCACCCAAGATCCCCCAGAGTCCAGGAGCTGGACGGGGACACCC - 78 O 
781 - TCAGCCCTCATAACAGATTCCAAGGAGAGGGCACCCTCTTGTCCTTATCTTTGCCCCTTG - 84 O 
841 - TGTCTGTCTCACACACATCTGCTCCTCAGCACGTCGGTGTGGGGAGGGGATTGCTCCTTA - 90 O 
901 - AACCCCAGGTGGCTGACCCTCCCCACCCAGTCCAGGACATTTTAGGAAAAAAAAAATGAA - 960 
961 - ATGTGGGGGGCTTCTCATCTCCCCAAGATCCTCTTCCGTTCAGCCAGATGTTTCCTGTAT - 102 0 
1021 - AAATGTTTGGATCTGCCTGTTTATTTTGGTGGGTGGTCTTTCCTCCCTCCCCTACCACCC - 108 0 
1081 - ATGCCCCCCTTCTCAGTCTGCCCCTGGCCTCCAGCCCCTAGGGGACTAGCTGGGTTGGGG - 114 0 
1141 - TTCCTCGGGCCTTTTCTCTCCTCCCTTTTTCTTTCTGTTGATTGTCGCTCCAGCTGGCTG - 12O0 
1201 - TATTGCTTTTTAATATTGCACCGAAGGTTTTTTAAATAAAATTTTAAAAAAAGAAAAAGG - 12 6 0 
1261 - GAAAAAAAAGCCACGGAGTCCATTTTATGAATGGGGTGGGGAGAGGGCACTAAAGAGCCT - 132 0 
1321 - CCTAAGAGAGCCTCAGGTTAGGACAGAATTGTTTGGGGAGGGAGAAAAACAGAAACAATG - 138 0 
1381 - AATTATAGCTGCCTCACAGCCATGTATAACAATAATTGCTCCAGGAAGGTGGGAATATTT - 14 4 0 
14 41 - GCTTTTTTTTCTTCTGTAATCTCACCGTGTCCGTGTCCAGAACAGAGCTAGGCACACAGC - 15O0 
1501 - AGGTGCTCAATTTTTGTTTTTCGTTTAGACAGGTTTCATTCTTTCACCCAGGCTGGAGTG - 156 0 
1561 - CAGTGGTGCTATCATAGCTCATTGTAGCCTCAAACTCCTGGGCTGAAGTGATCCTCCCAC - 162 0 
1621 - CTCAGCCTCCTGAGTAGCTGGGACTACAGGTGCACTCTGCCATGCCGGGCTAACTTTTAA - 168 0 
1681 - AAATTTTTGTCCGGGCACAGTGGCTCATGCCTGTAATCCCAGCACTTTGGGAGGCCGAGG - 174 0 
1741 - TGGGTGGATCATGAGGTCAGGAGTTCAAGATCAGCCTGGCCAAGATGATGAAACCCTGTC - 18O0 
1801 - TCTACTAAAAATATAAAAAAAAATTAGCTGGGCGTGGTGGTGGGTGCCTGTAATCCTAGC - 18 6 0 
1861 - TATTCAGGAGGCTGAGGCAGAGGATTGCTTACACCTGGGAGGCGGAGGGTGCAGTGAGCC - 192 0 
1921 - AAGATCGTGCCACTGCACTCCAGCCTGGGTGACAAAGTGAGACTCTGTCTCAAAAAAAAA - 198 0 
1981 - TCTTTGTGTGTGTGTGGAGATGAGGGTATGCACTTTGTTGGCCAGGTTGGCCTCGAACTC - 204 0 
2041 - CCAGCCAAGCAATTCTGCCTGGGATTACAAGCGTGAGCCACCATGCCTGGCCTCAAATAT - 210 0 
2101 - TGTTGAATGGCTAGCAGTTAAGTCCTTGGGTTTATAAGCATTTCCTCAACTGTCCTCCCA - 216 0 
2161 - AGTCCCCATAAGACAAAAAACTCATAAAATCCCACCTTACAGAAGAGGCAGCTGGCCCGG - 222 0 
2221 - CACAGAGATGCTGTCTGCCCCGGGTCACACAGGGTGGCATCTGACACCCTGTCTGAGTTC - 228 0 
2281 - TTCACTCAGAGTCTTTAAATATAATTAGCGTATTTGACATAATGTACATTAAAAACTATA - 23 4 0 
2341 - AACCTGTCAGCCTTTGTCTACTGCAAAGAATCCACTACAAATATTGGGGCAGGGATCTGT - 24 OO 
2401 - TCTTGGACCATAGTAGTGTCTCCAGACCTCATGGTCCTCTTCATTAAAACAACAGAAAAT - 24 6 0 
24 61 - TCCTTCTGGGCCATCAGATGAGACCATGAGATAGAAGATTTCCAAGTGAAGATTTTGTTT - 252 0 
2521 - CAAGACAGAGTCTTGCTCTGTCACTCAGGCTAGAGTGTACTGGTGCAATCATAACTGTGG - 258 0 
2581 - T G AC AG C C T C G AAC T T T T G G G T AC AAG T GAT TCTCATGCCT C AG AC AAC AC C C AAC T AAT - 2 64 O 
2641 - ATTTTGGTTTTTGTATAGACAGGGTCTTGCTATGTGGCTTAGGCTGGTCTTGAACTCCTG - 27O0 
2701 - GCCTCAAGCAGTCCTCCCGCTTCAGCCTCCTAAAGTGTCAGGATTACAGACATGAGCCAC - 27 6 0 
2761 - CAAGTCCAGCCTGAAGATTTTTAAAAATTATTGT.TAGTAGTAGTCGCCAGAGTTACTACA - 282 0 
2821 - TCCAAAGTCCCTACTAAGTTCTAAGTAGTCCCTACTAAGTTCTAAGGCAGTTTCTCAACT - 288 0 
2881 - CATTAGAGTTGTTTTTTGTTTTTAAAGAAAAAAAGAGGCTGGGCACTTTAGGAGACCGAC - 294 0 
2941 - ACGGGAGGATCGCTTGAGTCCAGGAGTTTGAGACCAACCTGGGCAACATGGGCCCCCATC - 30OO 
3001 - TCTAAAAATTTTAAATTAAAAAAATGTTTTAACAACAAAAAGCGTTCTGGGAGTGAGGGG - 30 6 0 
3061 - CTGGGGCCTGGGCGGCCTCATTCCATATACCTGTGCCGGGTTGAGGGGTTGGAGACACGT - 312 0 
3121 - TTAGAGACCCCTCCACTCTAGGAATCCACCTCGAGAGATAAAGGTCCCGGCCCTAGCCAC - 318 0 
3181 - ACCCCCAGGACACGGCCAGAGGCCACCTCCCTAGGCGGGTCCCTCCCCACCGCCAGGTTC - 324 0 
3241 - CTGGAGCGCGTGCGGCGCGTGTGCAGGGGTAGGGGGCCGCAGGCGCGCGGACTGGAGAGG - 33O0 
3301 - CGCGCCCCTCCCGCGTGTTGAAATTCAAAAGAGGCGAACGGCCCCCGGCGCGGCGGCGCG - 336 0 
3361 - GCTCCGGTGGAGAGGTCAAGGCAGGGGCCAGTCGGAGGCTCCCGGGGCGGGGTCGAACCC - 34 2 0 
34 21 - GCGGCCAACCTGAGCAGCAGCGGAAGCTTAAAGAGCTCAGGTTCCCGCCCCCCGGCCCTA - 34 8 0 
34 81 - CCATGGCTACAGAGCAGTGGTTCGAGGGGTCGCTCCCCCTGGACCCTGGAGAAACACCGC - 354 O 
3541 - CTCCAGACGCCTTGGAACCTGGGACGCCGCCCTGCGGAGACCCCTCCAGGTCGACGCCCC - 360 0 
3601 - CTGGCAGGCCTGGGAACCCATCTGAGCCGGATCCTGAAGATGCCGAGGGGCGGCTGGCTG - 366 0 
3661 - AGGCCCGGGCCTCCACGTCTTCCCCCAAACCTCTGGTCCCCCGGCCTGGGCCAGCACCTC - 372 O 
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3721 - CCCGCCTATCCCTGGACACTTTGTTCAGCCCCATCACCCAACAGCTGCGCTACCTACTGA - 37 8 0 
3781 - AGAAGGCAGATGATTTCCAGAGCTACTTGCTCTACAGGTGATGCTGGACAGGGTCCCAGG - 38 4 0 
3841 - TCCCCATGGGTAAGGAGACTTGGAGGGGAGGCGACAGGATGGGTGACACACACCAGGGTC - 39O0 
3901 - GCAAAATTACAAGCGCTAGGAGCCAGAGGGAGACAGTGGAAGAAGCTAGCATATTAGAAT - 39 60 
3961 - CCAGTTTAAGAGAATGAGGAAGACTGTAGAATTGCGGGTAGGGGATGGCTGCTATTACTG - 4 02 0 
4 021 - TCGTGGCAGGGTGGGCCTGGGGTTGTCAAGTCTCTAGGACTTTTTCTCCCAGTTTTTAAG - 4 08 0 
4081 - TGCTGTCTTACATTTTGAGCCCTGTGCTGGCTAAACAAGACCCACCTGAGCCAAACTTGG - 414 0 
4141 - CCTGCAGGACATCAGTTTGAGACTCCAAAGGATAATGTGATTCCCAGACCAGGTTTCCCT - 4 2O0 
4 201 - GTGACTCTCAATTTCAGTGTCCATTGGAATTTCCTAGGAGGCTGGGTTGGGTTTGTTTGC - 4 2 60 
4 261 - GTGTTTGTTTTTGAGATGGAGTCTCACTCTGTCGCCCAGGCTGGAGTGCAGTGGTGCAAT - 4 32 0 
4 321 - CTCAGCTCACTGCAACCTCCGCCTCCCGGATTGAAGCAATTCTCTGCCTCAGCCTCCCGA - 4 38 0 
4 381 - GTAGCTGGGATTACAGGCGCCCACCAACATGTGTTGCCCGGCTAATTTTTTTCTTTTCTT - 44 4 0 
4441 - AGTAGAGACAGAGTTTCACCATCTTGGCCAGACTGGTCTTGAGCTCCTGACCTCATGATC - 4 5O0 
4 501 - CACCCGCCTTGGCCTCCCAAAGTGCTGGAATTACAGACGTGAGCCACCGCGCCTACCCGA - 4 5 60 
4 561 - GGCTGGGTTTTTTTGTTTTGTTTTGTTGTTATGTGTTTTTTTGAAATGGAGTCTTGCTCT - 4 62 0 
4 621 - GTCACCTAGGCTGGAGTGCAGTGGGGCGAACTCAGCTCACTGCAACCTCCGCCTCCCAGG - 4 68 0 
4681 - TTCGAGGGATTCTCATGAGGCTGTTTTTTTTTTTTTAATGAGACAGGGTCTCGCTCTGTC - 474 0 
4 741 - ACCCAAGCTGGAGTGCAAGTGGGGCAGTCATAGCTCACTGCACCCTCGAACTCCTGGTCT - 480 0 
4 801 - CAAGCAATCTTCCACCTCCCCTCCTGGGTAACTGGGACTACAGGTGCCACCATGCCCAGC - 4 8 60 
4 861 - TAATTATTTTTGTGTAGAGATGGGTTCTTGCTATGTTGCCTAGGCTTGTCTGGAACTCCT - 4 92 0 
4921 - GGCCTCAAGCAATCCTCCAGCCTCAGCCTCCCAAAACTCTAGGATTGCAGGCGTGAGCCA - 4 98 0 
4981 - CTGTGCCCAGACCCTGCAGGAAGCTCTGGGTCCTAAGTGTTGTGACACTCAGGTGTCAGC - 504 0 
5041 - ACTTTAACAAGTGTTCCAAATGGGTTTGATGCAGGTAAACCAGAAAGATGTTCAGAAAAG - 510 0 
5101 - ACCTGAAACTGGGGGCTTTTCTAATGGGTCAAAGCCAGGGATACAGGTTGGGATTGAGTA - 516 0 
5161 - GAATGGGGAAAACTGCGGGGTGGGGAGGGGTTGTGAGGGATTCCAGGCAAAGGCCCCCTT - 52 2 0 
5221 - CTTCCTTCAGCAGAGACCAAGTACAGAAGGAGCAGCTGGCCAAGGCCATGCCCACCTTCT - 528 0 
5281 - TACAGATGTGTGAGCCCTACTTCCTGTACCTGGAGGCAGCCGCGAGAAGCATACCCCCCA - 534 0 
5341 - TCTATGGACCCCTGCAGGAGCTGGTCCGAAAGGGGGTGTGTGGAGGTTTCTTAGACCCCA - 54 0 0 
5401 - CGCCCCTTTCTTCTCGCAGCTCTGAGCCTGTGGGGATGGTGGAGGGGGAGGCCCACTCCT - 54 6 0 
54 61 - CGCAGGCCAGCTGATCTCACTGTACCCCCCTCTTGTATGCAGCTGTTAGAGATCTCCCAA - 55 2 0 
5521 - CAGCTGACCCTGCGCCTGGAACAGCTGGTCCTCATGTACGCTTCCTTTGGGTTCGTGGAC - 558 0 
5581 - CTGGAGGAGATGAACCCCCTTAGGTAAAATGGTAGGAGACTCAGATGGGGGGATGAAGGA - 56 4 0 
5641 - GTCCAAGGCCCAGCCTCACCCCTCCATTCTCTCATGTCTCGCCAGCATCTCCTGTTTCTT - 57 0 0 
5701 - TTGCGGGAGGTTCTCCATCAGCCTGTCCCATGAGGTCTCCATCTTCAGATACTGTGCCCC - 57 60 
5761 - AACCGCCTACACTGCCAGCCGCTTCCCCCGCTACCTCTATAAGAAGATGCGCTGGCACCT - 582 0 
5821 - GGAAGCCACCCCAGAGGCCCCTGGTCGGGGACAAGATTCCCTTGTGGATTAGTAAGTCCT - 58 8 0 
5881 - CTTACCCAAATCAAAGTCCTCCCCTTTCTATGATGAATGCCAATATGACCCTCCAAACCG - 594 0 
5941 - TCACCAGCAAAGTGAAAAGTGAGCCAGGGCCCGAGGCAGTGGCTCACGCCTGTAATCCCA - 60 0 0 
6001 - ACACTTTGGGAGGCCGAGGCAGGAGGATCACTTGAGCTCAAGAGTTTGAGATCAGCCTGG - 60 60 
6061 - GCAAGATGGCAAGACCCTGTCTCAACAACAAAGAAATTCGCCAGGCGTGATGGCTGGCAC - 612 0 
6121 - CTGTAGTCCCAGCTACTTGGGAGGCTTAGGCAGGAGGAGCACTTGAGCCCAGGAATCAAG - 618 0 
6181 - GCTACGGTGAGCTGTGATTGTGCCACTGCACTCCACCCTGAGTGGAAGCAATAATCTGTC - 62 4 0 
6241 - TCTTAAAAAAAAAAAAAAGTGAACCAGGAAACTAAAGGCTTTTGAAAGGCTACCTCTATT - 63 0 0 
6301 - TTCTTAAAACCCACCCTCCCACCAAAATAAAAGTTCTCATCTTAAAAGTAGGCTGGCAGG - 63 6 0 
6361 - GAGAAAAGGCCTTGGAGTCACATTCCTACCTGAGAACTTCAGGGCAACTTCTGATGAGTT - 64 2 0 
6421 - CCCACCTCAACTCCAAAATTAAAGCCCTCAACAGAAGTAGCTAGGAAGCTGATCACTTCT - 64 8 0 
6481 - AATTACAGCTCCCTCCCCTCCTAGCTACTTTCTGTGCTATCGAGATACTTGGGAAGACAC - 65 4 0 
6541 - AGGCCAGAGTCCAGCCAATTCGTGCCCACAGATCCAGAAGCTGTGGTCCATCGGCCGATG - 66 0 0 
6601 - GGTGCCCCTAGGACCAGCCGAGGATGACCTTTATTCATGGTAGGAGCTAGGGCAATAGCA - 66 60 
6661 ~ ACGTGGGCCTGGGAGCTGGAGGGGGAGGCAGAACCCCACCAAAGACAATCCACCTTCCCA - 67 2 0 
6721 - AACACTTTGCTTCCCTTAGTAGTGATAGCATTTTATTGTGCCCTGAAAAGCACTTCATGC - 67 8 0 
6781 - AGACCCCAGTAACAACCCATGGAGATCTATGCTATTGGCCCCATTTAACAAAGAAAACAG - 68 4 0 
6841 - GGTGCTCAGAGAAGTTGTTACCTGCCCAAGGACACACAGCTAGCAGAGCGAATGGACAGG - 69 OO 
6901 - TCAGGACCAGTTATTCAGCCTCTAGGAGCCATTACTAAGTCTCTGATCAACAAGG7VAACA - 69 60 
6961 - AGTTTCCCCCGGGGGTTTTTCCCACCCGCAGCTGAAACAAAGCCTCTTTCACCTGAGCCT - 702 0 
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7021 - CTCACTCAAAGGGAGGGACTCCCGAGGGGCAGGGGGCACTCAAGTCCAGGCCTGTCTATC - 708O 
7081 - CCTGGCCCCCCCACCCCAGGATTTTGTGCCCGCACCGCTTGGGGACTACCAGCAGCTGCT - 7140 
7141 - GACCATCGGCTTCGAGGAGCCCACGCCCACGCTGGCCACCGACCTGCTGGTGCAGATCCT - 720O 
„^^,^^^.^n^^o^^^r-^Trr-^nr:rcrar:rrc;f;GCCTGCGGGGTGGGCAGC - 7^60 



7201 - CACGGGCCAGGCAGGCCAGGCCCGGCCTCCGAGCGCAGCCGGGCCTGCGGGGTGGGCAGC 
7261 - GCAGGGGTCTTGAACCTGGGGAAGAGGGTAGGAGCTGGAACTTGACAGTTCCAAACTCCA - 7320 
7321 - GAATAGGGGGCAGGGGAGGGGCTCACTCGTTCTCGCAGTGCAGCCGGGCCTCGCCTTCCA - 7380 
7381 - AAGGGCCAGGCCGAGCTGACCTGTCTGCACCGAGTCCGGCTTGGCCGTGGGGCCCTGAAT - 74 4 0 
7441 - GCGGACACGTCAGTTTTGTGTTAAATAAAAGAAAGAAAGAGGTCACAGGCTCAGCGTCCG - 7 50 0 
7501 - CTGCGAATGCCGCGCCCCTCCCCCGGGGGATTGCCCCACCCACTCGCGTGGCCTTCTGGG - 7560 
7 561 - AAATGTAGTCTTTTGAAAGAAGCCTGGAATTCGCCAATAGGCGGACGAGAGTTTGGCGCA - 7 62 0 
7 621 - TGCGCATAGGCGCACATGAAGCAAAAAGGGAAGTGGTGCCCGTCAACACCGGAACCCAGA - 7 68 0 
7fi81 - AAACTGCAAGTTTAGGGTACCGGGGAAATTCAACGTCCACTGGAGGAAGAGACTTAAGGC - 774 0 
7741 - TACGCCCACTCCCATATTTTGACCCGGAAGTTATTTATTTTAGCGTAGAAGACTACTTTT "780 0 
7801 - CCCGACGCGCCCCAGGAAAGTGCCCTCGATCAGTTTCCTAAGGGCCCGAGTTAGACTTTT - 7860 
7861 - TTTTTCTCTTCCAGCTTTTGGGACTTGGGGGCCGGACAGGTCGTCGTCTTTCTTGGGGTA - 7920 
7 921 - TCCGGGGTGCGGACAAGGTGGGAGAGCCCTACGGTATCCAAGCTT - 7 965 



FIG. 23F 
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1 - CAACATGCTTGGGACCAGAAGTGTTTCCAATTTGGGATTTTCTCAAATTTTACCGGTTGA - 60 
61 - GCTTCCCCAATCTGAAAATCTGAAATCCAACATGCACGGCTCTGAAGTCTTTCACTGAGC - 120 
121 - CTTTGGGGGAAATATTTAACATCCTAACAGCCCTAAACCAACGCTCAATTAGCACAACAG - 180 
181 - TTTACAATCTTCTCTACCCACAGCCTGATGCGAGGCTCTGGGACTAGACTATTTAGCCAA - 2 40 
241 - CAGTTCTTGCAAAATTAACTGACTTATAAGTAAATAGTAATTTCAACACCTCACTGCTAA - 3 00 
301 - TGCTGTAACAACTCTGCAGACCTAGGGAGCAAGTACGGTTTGCAGAGCACTGGGAAGGCT - 3 60 
361 - CTGAAGTGACCTTTGAACTGGGCCTCAAAAAATTTTGGGTTTGGCAAAAGTCAAATCTCT - 4 20 
4 21 - TAGGCTTCAAATTCCAGGCACAAGGATTGTTGGGTTTGATTTCATTATCCAGAAGCAATG - 4 80 
4 81 - GGGATACAGAATTGTGATCTCATGTGTAGGGAACTGTGGGGGTTTTTTCTACTTTAACCC - 5 40 
541 - CAGTGAGACTTTGTAGAGTGTGGGGTAGAGAAAAGGCTCATGAATATGCCTGAAGCCTAA - 6 00 
601 - CTCAGCACCTTTCTGAGGAACTGACTGCCAAAATGGTAATGGAGAGGGGAAAATATGACC - 6 60 
661 - TACTTTCACAAGTTACCTTGACTGCCTCAGGGAAACCTGCTGTGGTAGTGTTTCTTCTGG - 7 20 
721 - GTGAAAGACCAGGTAATTACCTGGGTGCTGGTCTCAGACTTACCAGTTTTGAATCCCTGT - 7 80 
781 - TTTAACCACTCACTATCGATATGACCTTGGATAAGTTACCTAACCTTTCTCTTACTGTCC - 8 40 
841 - TTTTCCGTAAAATGGGGATAACAGATAGTAGTTATTTCTATGAGTGGTTATGAGAACCAA - 9 00 
901 - GCTATTAGATAGCGGGAAAGCACACAGTAAGCGTTCAAGGAACTGCTATTGTTATTAAAA - 9 60 
961 - GCCTCCTTTGGAAGAAGGACATTGAGGCCCAGAGAGAGAACAGAACGTCCAGCCACACAG - 1020 
1021 - CAAATCCGTGATGAAGTTGGGACTGGAGTATGGGTCTCCTGAGTCTCAGCCCAGGACTCT - 1080 
1081 - ATCCCTCTTCCCGAGTCCTCGGAGTTCCCGGATGGAGTCACATTTGTTCACGGCGAGGGA - 1140 
1141 - GGAAGGTTTGATGGAGGCCTGCAGGAAACAACAGCCAGGCGCAAGGCTTTGGGAGTTGAA -'1200 
1201 - GCATAGCTTCTGCGAGATAGAAACAAGGTTGACATGGGCACTCGTGCAGAATGACGGGCT - 1260 
1261 - CCTTTTGGACTCCCAGGACTACAGTCCCTTATGCACCTTGGGATCTGCGGCTAGCCCCTG - 1320 
1321 - CGTAAAGAGGGACGCGTAGTCTTTTCCCTGCCCCGCCCTGCCGGGGCGCCCGCCTCCGAG - 1380 
1381 - GCCGCCCTCGCTTCGTCCTTCCCAGCAAGCTCCGCGCCGGCGCCGGCTATTGATTGGCTG - 144 0 
1441 - AGGCGGGAGCAGGCGGCTGGCCGGCAGCAGTTACTCGGGGTTTCCGGTGCGAGGCCAGAG - 1500 
1501 - GTGGGGAAGCCATCGGACGTCGGCGGTGAGGTACGTGCAGCGGCGGCCGGTGGGCGAGAC - 1560 
1561 - TATTTGAGAGTGTGCGGGCCGGGATGTTCTCGGCCTGTGGGGAAATCACGCCAACTCCCC - 1620 
1621 - GCGTGGGCCGGGGGCTGTCTGGGGATATGCGCATGCGCGGGCGTGCCTCGCGGCTTGAGG - 1680 
1681 - GCGCGCGGGGCGTGGGTGGCTGCGCGCGCGGGGGGCGCACGTGGGGCCTGAGGGGCGGGG - 174 0 
1741 - GCGGTGCCGGGAGTCCCGCCACGTCAGTCTCCGGCCCTGAGCCAATCCCGCGCCCGGCCT - 1800 
1801 - GCCGCGAGGGGGCCGGTTGTGCCGGGAAGTGGCTCCAGGGAGAAGAGGCCTCTTCCCTCA - 1860 
1861 - CCCGCTGTGGGAGCTGCGCCCCGAAAGCCTGCCCCGGCACGTCGGGCTCTCCTGACCCGC - 1920 
1921 - CAAGACCAGAGAGCCGTTGGCGCCCTCCGCCCGGGCCTGCCGGTCCGTTTATTTTAAGAA - 1980 
1981 - GCTTTGTGCGCCTGCTGTGGGGATTTCTGATCCAGGCTGCGAAGAATTTCGAAGTCTGGA - 204 0 



2041 - AAATAGCAACTGTGTTTGTTTCTAAAGGATCTTCTCCTGACCCAGCATCGCTCATCACAA - 2100 



2101 - TGAAGAACCAAGACAAAAAGAACGGGGCTGCCAAACAATCCAATCCAAAAAGCAGCCCAG - 2160 
- KNQDKKNGAAKQSNPKSSPG 

2161 - GACAACCGGAAGCAGGACCCGAGGGAGCCCAGGAGCGGCCCAGCCAGGCGGCTCCTGCAG - 2220 
QPEAGPEGAQERPSQAAPAV 

2221 - TAGAAGCAGAAGGTCCCGGCAGCAGCCAGGCTCCTCGGAAGCCGGAGGGTGTGTGCCAGC - 2280 
EAEGPG SSQAPRKPEG 



2281 - TCTGCGTTGCCAGCGGGCAGGGGGAGGAGCTGTGGGGTCGGCCTCGCTTCTGGACTTACA - 2340 

2341 - GGCCGAGGCCAGGTTGTCCGGGAGGAGGAGATGTAGAATGAGAGGACAGTGCTGGGGGCC - 24 00 

2401 - GCGGTCCCCCCTGCGCTCTGGCGAGTTGGCGGAGCTGCCCCCTCTAAGCACAGGAACAGA - 24 60 

24 61 - GTTCTGGAGAGAAGCTCCGACGGGATTAAGTCAGGTGGCAGCCAAACGAGGCACCCAGTC - 2520 

FIG. 24A 
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2521 - AGGAAATCCAGGTCCCGTTAGAAACACCTCAGCCACCAGCAGCTAACTGCCCTTCCTGTT - 2580 
2581 - TGAGGCATTTCTAGAATGATCTGAATGGCAAGAAATGGGTTTTGTGGGGGGGAAGGAGAT - 2 640 
2 641 - GGACTAGAAGTTGCTCCGTGCCATCCCTGTGTGCTGATGCTTTACATACTTTTATGATCT - 2 7 00 
2701 - AACAAATATGTTCGGGTGGTAGTGAGAAATAGTTGTGTCATTTTACAAGTAAACAGACTT - 2 7 60 
27 61 - AAAGAAGTTAGGCAACGATTACTATAATTTCTTGATTTAAAAGATGTTTCGAATCTAAAT - 2820 
2821 - TCTGACAGGAACTAGATTTGCTGAATGATACTCCATTCTTGCTTCTCAGTTTCCATAAAA - 2880 
2881 - AAAAAAGTTAGGCAACATTTAACTCAAACTGATGAGTTTGGCTGGGCCTGAAAAATCCCA - 2 94 0 



2 941 - ACCAGTGGTATAATCGTCTTCTTTCTCACTCTACCCCTCATCCTCTCCTGCTGTAGGGGC - 3000 

A 

3001 - TCAAGCCAGAACGGCTCAGTCTGGGGCCCTTCGTGATGTCTCTGAGGAGCTGAGCCGCCA - 3060 
-QARTAQSGALRDVSEELSRQ 

3061 - ACTGGAAGACATACTGAGCACATACTGTGTGGACAATAACCAGGGGGGCCCCGGCGAGGA - 3120 

-ledilstycvdnnqggpg/ed 

3121 - TGGGGCACAGGGTGAGCCGGCTGAACCCGAAGATGCAGAGAAGTCCCGGACCTATGTGGC - 3180 
~-GAQGEPAEPEDAEKSRTYVA 

3181 - AAGGAATGGGGAGCCTGAACCAACTCCAGTAGTCAATGGAGAGAAGGAACCCTCCAAGGG - 324 0 
-RNGEPEPTPVVNGEKEPSKG 

3241 - GGATCCAAACACAGAAGAGATCCGGCAGAGTGACGAGGTCGGAGACCGAGACCATCGAAG - 3300 
-DPNTEEIRQSDEVGDRDHRR 

3301 - GCCACAGGAGAAGAAAAAAGCCAAGGGTTTGGGTGAGCAGAGGGCGGCTCTTTGTGAAGC - 3360 
PQEKKKAKGLG 



3361 - TGGTGAGGAGAGGGAGTTTGGACTTGACGTTCTCTGGGCCAGTCTGTTCTGCCAGGATTC - 34 20 

3421 - AAAGGAAAACGGTACTTCTCAGAGCAGCAAGTCACTCTAGTCTAATCAAAGCCAGGGATG - 34 80 

34 81 - TGGGGGCCACGGCATAGAGAGATGCAGGAGTTACCAGCACAAAGCCTTCTGGGTTTTGGA - 354 0 

3541 - GCAACTGGAGCTTGGCATGGGACCTGTTCTCTCTTTGAGAAAATGGAGACGGGAGGCTAG - 3600 

3601 - GGTAGGCTCCTGTGCCAGCCAGTACTACCTGCTGTGTGACCTTGGGTGTGTCCCTTCTCC - 3660 

3661 - TCTCTGGGTCTTAGTTTATATTTCTCTTTACAGTAAGAAAATTAGACTAGGCCAGAGTTG - 3720 

3721 - AAAACCCAAATATCTGCATAAGCTGGGCTTGGCCATGGGGCCACCTGAAGATGGAGGCTT - 3780 

37 81 - TACTGCTTCCCTGATTAGTTGCTCTCACTAGCCAACTGAGAGCAGGCAAAACTACAGGCT - 38 4 0 

3841 - GGGTGCAGTCAGGCTTTTTTTTTTTTTTTTTTTTTTTAAATAAAGAAAAGCCAGAAATCT - 3900 

3901 - AGAGTTATGTGAGAACTCTAGATTTTTTCATAGTTAGCAGCTAAAATGGTAAGAGCCAAA - 3960 

3961 - CAAAACCCATCCGTGGGTTGGATTTGGCACACATGCCTGCGAATTGCAGTCTCCATGCTG - 4020 

4 021 - ATCTCTTGGGCCCTTCTGGGGAGGCAGAGGGAAGGCTCCCTGACTCAGTCACAGGCAATG - 4080 

4 081 - GGGAATAGGCAGTGACAGTCATTTTACAGCAGGGTATGTATGTTTAAGAGTCTAGGCCGG - 414 0 

4141 - GGTGTGGTGGCTCACGCCTGTAATTGCAGCACTTTGGGAGGCCGAGGCGGGTGGATCACC - 4 200 

4 201 - TGAGGGTCAGGAGTTCGAGAACAGCCTGGCCAACATGATGAAATCCCGTCTCTACTAAAA - 4 260 

4 261 - ATACAAAAATTAGCTGGACATGCTGGCACACGCCTGTAATCCCAGCTACTTGGGAGGCTG - 4 320 

4 321 - AGGCAGGAGAATGGCTTGAACCCGGGAGGCAGAGGTTGCAGTGAACTGAGATTGTGCCAC - 4 380 

4 381 - TACATCCAGCCTGGGTGACAAGAGTGAAACTCTGTCTCAAAAAAAAAAAAAAAGAATCTA - 4 44 0 

4441 - GAATCTAAGTCGAGTGTCATTATATCCATGTTTTATTCCTATTCCCTTTTCCCCTTATGT - 4 500 

4 501 - ATCCTCTTACTTTAAAGAGGAACTTTAAAAAATCTTAGGGACGACTAGGCAGAGTGGCTC - 4 560 

4 561 - ACACCTGTAACTCCAGCACTTTGGGAGGCCAAGGCAGGCAGATTATGAGGTCAGGAGTTC - 4 620 

4 621 - GAGACCAGCCTGGCCAACATGGTGAAACCCCAGTTCTACTAAAGATACAAAAAATCAGCC - 4 680 

4 681 - GGGCGTGGTGGCACGTGCCTATAATCCCAGATACTCGGGAGGCTGAGGCAGGAGAATCAC - 47 4 0 

4741 - TTGAACCCGTGAGGCAAAGTTTTCAGTGAGCTGAGATCATGCCATTGCACTCCACCTGGG - 4800 

FIG. 24B 
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4801 - TGACAGGGTGAGACTCCATCTCAAAAAAAGAAAAAGGAAAAAATCTTAACGTCACATACA - 4 860 

4 8 61 - TGGAAAGATCATCTTTTTCACCCCCCACCCCCAACTGAGATGGAGTTTTGCTCTTGTCAC - 4 920 

4 921 - CCAAGCTGGAGTGCACTGGCGCGATCTAGCTCCCTGCAAGCTCCGCCTCCCGGGTTCACA - 4 980 

4L J 31 - CCATTCTCCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCTCCTGCTACCATGCCC - 504 0 

5041 - GGCTAATTTTTTTGTATTTTTTTTAGTAGAGACGGGGTTTCATCTGTGTTAGCCAGGATG - 5100 

5101 - GTTTTGATCTCCTGACCTCGTGATCCGCCCGCCTCAGCCTCCCAAAGTGCTGGGATTACA - 5160 

5161 - GGCGTAAGCCACTGCACCCCGCCTTTTTTTTTTAATTAATTAATTTTTTTAGACAGAGTC - 5220 

5221 - TCGCTCTGTCCCAAGCTGGAGTGCAGTGGCGCGATCTGGGCTCACTGCAACCTCCGCCTC - 5280 

5281 - CTGGGTTCACGGCGATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGACTACAGGCTCCTG - 534 0 

5341 - CTACCATGCCCGGCTAATTTTTTTGTATTTTTTTTAGTAGAGACGGGGTTTCACTGTGTT - 5 4 00 

5401 - AGCCAGGATGGTTTTGATCTCCTGACCTCGTGATCCGCCCGCCTCAGCCTCCCAAAGTCC - 54 60 

54 61 - GCCTCAGCCTCCCAAAGTGCTGGGATTACAGGCGTAAGCCACTGTACCCTGCCTTTTTTT - 5520 

5521 - TTTAATTAATTAATTTTTTTAGACAGAGTCTCGCTCTGTCACCAAGCTGGAGTGCAGTGG - 5580 

5581 - CGCGATTTGGGCTCACTGCAACCTCCGCTTCTTGGGTTCAAGCGATTTTCCTACCTCAGC - 5 64 0 

5641 - CTCCGGAGTAACTGGGACTACAGGCGCGTGCCACCACACCAAGCTAATTTTTTTGTGTAT - 5700 

5701 - GTCTTTAGTAGAGATGGGGTTTCACCATGTTAGGATGGTCTCGATCTCTTGACCTCGTGA - 57 60 

5761 - TCCGCCTGCCTCGGCCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCTTGCCTGGCC - 5820 

5821 - GAAAGTATCTTCATTTTAAAGTTCACTGTTTGGCTACTCTGTTGACAAGAGTTTAGTATT - 5880 

5881 - TCTCAAGGAGGCTAAGATACCTATTCCTTTTTGGATCCTACCTCTATCAGGAGGGTGGGC - 5 94 0 

5941 - CTTCCTTGCATTGAAACAGTATGAAAACAGTAGCCCTGAATTCATAAGTGGGACACCTTT - 6000 

6001 - CTTCTATTGGTAGAGCAGGCAGTTTTTTTCTCCTGCCAATGGTGCCTACTAAGGAGATTT - 6060 

6061 - CACTAGGGTACAGTCGTTCATTTGATAAGCATTTGTTGAGCATATCCTCTGTGATGGTAC - 6120 

6121 - TATGGACAGTACTGGGGCTATAGTGAGGGCAGGATTGAGTTGGTCCTTATGGCAAGGAAG - 6180 

6181 - GCAGCTAATCAACAAGCAAAATATAAAGTATGATGGGGAGGGCTGTCTTCAGCACTCATG - 624 0 

6241 - AGTGTGAGCCCAGGCCTGGAGGGGACACCTGGAGAAGAGGGTGCATGTCTTTGCTCCTGT - 6300 



6301 - GCTTTTCAGGGAAGGAGATCACGTTGCTGATGCAGACATTGAATACTCTGAGTACCCCAG - 6360 
KEITLLMQTLNTLSTPE 

6361 - AGGAGAAGCTGGCTGCTCTGTGCAAGAAGTATGCTGAACTGGTCAGTTCCCCCCTCCGCG - 6420 
EKLAALCKKYAEL 



64 21 - GGCACCTTGCCTGCGTTGGGAAAATCAGCATGCCACCTGGTGTAAGGTTGGGGGTGCAGA - 64 80 

64 81 - GTCAAGTAGGTGGCTTAATTCCTGTTCAGCTTTTCTCTGAACTATCTGTTAAATGGGGAA - 654 0 

6541 - TCACTTCCAGCCAGCCTCTTCAGGGCTGTGCAGCAAGAGGAGAAACTGCATATTCCTTGA - 6600 

6601 - AAGAAATTTCTCAAAGAATGATTCCAAGGTGGTAGAGCCCTTGTTCCTGGCCTGAGTCCA - 6660 

6661 - AGACACCTTGTGATCTTGATGCTTCTTCCTCAAATACAGATGCATAGAGCCATTATCACA - 6720 

6721 - GTTAATAAAACTAACACTAGTCACTTGATACTTTTTCCTTTTACTCCAGAGCAGTCTTCT - 6780 

6781 - TGTCACTGCCTCCTCATATTCCCCATGACATTGACTTTTAACAGAAACTAGACTAGCTGT - 684 0 

6841 - CTTGTAGGATGCCCCCTTCTAGCTTTGTCATCTCTGTGGTATCATTTTACTTCTTTACCT - 6900 

6901 - CCTGGTACATGTAAGTGAAGTAGAAGTTAGCTCTAAAGCTTGATCCAATTCAGCTTCAAC - 6960 

6961 - TTTTTGACAAGAATTCTTCATAAGTACTTCATGTTCCATCACAATAAATGCAAAGCATGC - 7020 

7021 - TCTTCCCACTTTGTTGTAACATTGTTCAGTGGGTTGGGGGTGGGGCAGCCAGATTCTTCC - 7080 

7081 - ATCATCAGGTCCCTTGTCAGAATTTGAACTAACAGATTTATCCATTGATGGTCACAGCCT - 714 0 

7141 - GTGTATGTATGTATGTATGTATGTATGTATGTATTTATTTATTTATTTATTTTTTGAGAC - 7200 

7201 - GGGGTCTTGCTCTGTCGCCCAGGCTGGGGTGCAGTGGCACGATCTCGGCTCGCTGCAAGC - 7260 

7261 - TCCGCCTTCTGGGTTCATGCCATTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGTCTACAG - 7 320 

7321 - GCGCCCGCCACCATGCTAGGCTATTTTTTTTTTTTTTTTTTTTTTTAGTAGAGACGGGGT - 7380 

7381 - TTCACCGTGTTAGCCAGGATGGTCTCGATCTCTTGACCTCGTGATCCGCCCGCCTCGGCC - 74 40 

7441 - TCCCAAAGTGCTGGGATTACAGGCTTGAGCCACCACGCCTGGCCTATTTATTTATTTATT - 7500 

7501 - CAGAGTCAGAGTCTCGCTCTGTCACCAGGCTGGAGTGCAGTGGCGCGATCTCGGCTCATT - 7560 

7561 - GCAACCTCCACCTCCCAGGTTCAAGCGAGTCTCCTGCCTCAGCCTCCCGAGTAGCTGGGA - 7 620 

7 621 - TTACAGGTGCATGTCACCATGCCTGGCTAAATTTTGTATGTTTTAGTAGAGACAGAGTTT - 7 680 
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7 681 - CAGTATGTTGGCCAGGATGGTCTTGATCTCTTGGCCTCGTGATCCGCCCGTCTCAGCCTC - 7 740 
7741 - CCAAAGTGCTGGGATTACAGGTGTGAGCCACTGTGCCTGGCCTCTAAGTATTTATTTTAA - 7 800 
7801 - AAT T AAT T C AT T C C AC AC AC AT T T AT T AAT AT TTTCCTGT AAG G AAC T T T AC T CAT C T T T - 7 8 60 
7861 - AAAATGGGGAATGTCATACCTGCCTAATGACATTCTTGTAAGGATTAAATAAAAGGTATA - 7 920 
7921 - AGGAAGATAAGCACCCTTTTGGAGTGATCCAGCCAGGGGAAAATTGCTGATGCAAGAGAG - 7 980 
7981 - GAAATGAGTTGCTAGAGTGGTGTTGTGAGTAGAGGAGGGGAGCTGAGGCCTGCCCAAGAA - 8 040 
8041 - GGGGGCTTGGCTGTGGTAACCACATGGCTAGGTCTGTGTGACTGGAGGAGAGGACGGGGC - 8100 
8101 - AGGTGGACTGGTAGATGTGCAGCTTGTGCCCCTGATTCTCTAGTTTCTTCTGTGTTTTGA - 8160 
8161 - GATTTGATGAGAACGATGAAATAGTTGTCTGGAAGGAGAGGAGTGTGAATAGCATATGCA - 8 220 
8221 - TTGTATTGGGATTGCTGGTCTTCCTGAAATTGGTGGCCATGAATTTAAAGTGAGACTCTT - 8280 
8281 - CAAGTAGGGTTGTTATAGTACTGGTGTAAAGCAGGAAGGTGCTTTACTAGGGTTGCAGTA - 8 34 0 
8341 - CTACTGGGGAAGGGCCAAGAGAGTTGAGGGTGTAAGAAATCCAAGCCAGGTAATGTAGTT - 8 400 
8401 - ATTTTAAAGGAGAGTGGAAGGATGGTTGAGTCAATGGATTGGAGGTCCTATAGGGTAAGA - 8 4 60 
84 61 - GACTTTCTGAGGATCACAGATACTGATTGGAATGAGCTAAAAAGATAGGTGATGGTAGTC - 8 520 
8521 - CTGGACTGGGATGCTGGAAATTGAGATAGTGGGTGTGCTCTCTGGTAGTGACAAATCTAG - 8 580 
8581 - ATCTGCGCTGTCCAAGATAAATTCGTCTCTAGCTAATTGACATGTGGCCAGTTTGAATTT - 8 640 

8 641 - G AAC AT G C T AT AAAT G T AAG AT AC AC AT C AG C T T T T G AAG AC T T AAG C AAAAAC AAAG AA - 8 700 
8701 - TATAAAACATCTTTTTGTGAGAGAGTGTCTCAGTCACCCAGGCTGGAGTGCAGTGGCGTG - 8 7 60 
87 61 - ATGTCCTGCTTCCAGGTTCAAACGATTCTCCTGCCTCACAGCCTCCTGGAGTAACTGAGA - 8 820 
8821 - TTACAGGCGCATGCCACCAAACTGGCTACTTTTTTGTATTTTTTTTTTAGTAGAAACGGT 8 880 
8881 - TTCACCATGTTGGCCAGGCTGGTCTTGAACTCCTGACCTCAAGTGATCTGCCTGCCTCAG - 8 94 0 
8 941 - CCTCCCAAAGTGCTGGGATTACAGGCATGAGCCACCACTCCCGGCCTCACTTTTTTACAT - 9000 
9001 - TGATTCCGTGTTGAAATTGTAATGTTTTGGATATTAGGTTAAATACATATATTACTAAAA - 9060 
9061 - TTAATTTCACCTGTTTTTTACTTTTTTAGTGCGGCCAGTAGAATATTTTTAATTACTTAT - 9120 
9121 - GTGGTTTGCATTATATTTCTGTTGTACAGGCCTGGATAGGGTCATGGGAGGGGAACTGAG - 9180 
9181 - CTGGGGAAAGGAGTGGGTTTGTGGAAGAGGTGATGGACTGTGAGGCCAGGGAGTTAGAAG - 924 0 
9241 - GATTATCTGTTGATACTGAAGTGGCCACAAATGAGAAAAGTAATTGTGTTGGGGAGAGCG - 9300 
9301 - CTGATGAACGCAGCGCTAACGTTTTGAAGGAATGCGAGGGAGCGATGGGGGTCTGTCTGT - 9360 
9361 - TAATAGGCACAAGGTACGGTAGCAGGTGGTCTCATCCTCGGGCATGAGTGTCCAGCAAGT - 94 20 
9421 - TGGGGAAATGCAACAGCTTGAAGTGGCTCTAGTGGCCCAGAGTCAGAGCTGGAATAGGAA - 94 80 
94 81 - TTGGCATCTGCTGGCTGTGTGGCCCCTGCTTGCCCTAGTGAGTTACCATTTCTCTGTCCC - 954 0 
9541 - TACGGTGGAGCCTTTGGGGTTATTGTGAGTTCATGGGAGGAGCGTGTAAGCACCGGCACA - 9600 
9601 - GCATCAGCCCATGAGAGTGCTCCTGGCCTGAGAGGGTAAGGGTCAGGGCAGCTCAGGAGA - 9660 
9661 - CCCTAGACCTGCATAGTGATCCCCCCACCAGGAAGGCCCCACAAGATGCTCACCTGCCCT - 9720 



9721 - CCCTATCCCTGTCCCCAGCTGGAGGAGCACCGGAATTCACAGAAGCAGATGAAGCTCCTA - 9780 

LEEHRNSQKQMKLL 

9781 - CAGAAAAAGCAGAGCCAGCTGGTGCAAGAGAAGGACCACCTGCGCGGTGAGCACAGCAAG - 98 4 0 
-QK-KQSQLVQEKDHLRGEHSK 

9841 - GCCGTCCTGGCCCGCAGCAAGCTTGAGAGCCTATGCCGTGAGCTGCAGCGGCACAACCGC - 9900 
-AVLARSKLESLCRELQRHNR 

9901 - TCCCTCAAGGTAGGCCTGGGCCCCCTGGAACAGGTGACTCTGGTTTCCTTGACTTCCACT - 9960 
- S L K 

9961 - TAATGTTTCTTTCATGGGCTTTCCTCTTAAAAAGTAGTGCAGGCTAGGGCCAGGCGCAGT - 10020 

10021 - GGCACACATAAGTGATTAAAAATCTTCTGGCCACTAAAAAACAGAAATTAATTTTAGTAA - 10080 

10081 - TATACTTAACCCAATATCCAAAACATTACAATTTCAACATGAAATCAGTGTAAAAAAGCA - 1014 0 

10141 - AGGCTGGGTGTGGTGGCTCACACCTGTAATCCCAACACTTTGGGAGGCTGAGGTGGATGG - 10200 

10201 - ATCACTTGAGGCCAGGAGTTTGAGACCAACCTGGTCAACGCAGTGAAACCCCATTCTACT - 10260 

10261 - AAAAATACAAAAATTAGCCGAGTGTGCTGGCAAATGCCTATAATCCCAGCTACTCAGGTG - 10320 

FIG. 24D 
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10321 - GCTCAGGCATGAGAATTGCTTGCACCTGGGAGGCTGAGGTTGCAGTGAGCCGAGATTGCA - 10380 

10381 - TCACTGCATTACAGCCTGGGCAACAGAGTGAGACTCAGTGTCCAAAAAAAAAAAAAAGTA - 10440 

104 41 - GTGCAGGCTTGTGGCATAGAAATACACTTTCTCAATAATGCCTTACGTTAAGAGAGTACT - 10500 

10501 - GCTTGTAATCATTTGACATGTATTAGATAAGGTGAAGGATAAAGTACTAAGAGAATCCAT - 10560 

10561 - AATGCACTGGCGTTAGTATTTCTCAATGAAATGACAGTCCCCTGGTAAGCGGAGGCCTGG - 10620 

10621 - CTCTGACAAGCAGCTCTTGTCCCAGACGTTGGTCAGTCAGGAACCTGGGTCCTTCCCATG - 10680 

10681 - TTCTGCTGCTTCTATGGTGAGGTCAGTCTGTGGTTACACCAAGTTTAAATACAGCCTTTT - 10740 

10741 - AACTTTCTTTTTTATATGTAAAATCTTACATGTAGTTTTTAGAATGAAATTATTATACAT - 10800 

10801 - GTACCATTTCATATCCTGTGCCTTTTTTTCACTTTACATAACATTTTTCCCTATCAGTAT - 108 60 

10861 - GTGTAGGGCTATCTTCTCATTATATGGATATATTATATCAGTGCCCTAGTTAAAGCATTT - 10920 

10921 - TGGGGGTTGTTTACAATTTTTCATTATTACATATAGAACTATAGTGAAAATTCTTGTTAT - 10980 

10981 - ATTTATCACTGGTCAGTTATATAGAACTTATCTGTAGGATAAGTCATGGAATTGAAATGG - 1104 0 

11041 - CTAGGTCACAGTATATGCAGATTTTTCATTTTAATAGATTTTGCTGGATTGCCTTCCAGT - 11100 

11101 - GAGGGGGCAGTGTGCCTTCCCCATCAAAAGTGTTGAGTGCCTAATTCTGCACAACTTTGC - 11160 

11161 - AAACCCTGGGTGTTACTAAATTTTAACAGCTTGGTCTCTGGGGGTACAGAGGGGACAAAT - 11220 

11221 - GCACATTAATCTGAAATCTGGAAGAATAGGCCTTAGGAGATCCGACTTGCTTCAGAATGG - 11280 

11281 - CACTTAGCACTTACATGTGTGCATGTGTGCCTGCATTTTTTCTTCCTTTTTTTTTTTTTG - 11340 

11341 - GGGACGGAGTCTTGCTCTGTGGCCCATCGCCCAGGCTGGAGTGCAGTGGCGCGATCATAG - 11400 

11401 - CTCACCACAACCTCCGCCTCCCAGGTTCAAATGACTCCTCTGCCTCAGCCTCCCAAGCAG - 11460 

114 61 - CTGGGACCACAGGTGCACACCATCACGCCGGCTAATTTTTGTATTTTAGTAGAAACGGGG - 11520 

11521 - TTTCACCATATTGGCCAGGCTGGTCTCAAACTCCTGACCTCGTGATCCGCCCACCTCAGC - 11580 

11581 - CTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCTGCCATGTGCCTGCATTTTT - 11640 

11641 - CTAGGGGGAGAATCTCACTTGATGTCACCTGATATACAGAGGGGCCCATTGGAACCCGCA - 11700 

11701 - TTGCACAACATCCTGGAGTCTGGCTACTCCACGCTTTGGGAGCAGGGAGGGCTGTTGGCA - 11760 

11761 - GAGACCATCTGTGGACTAGCTGGGGGACCCTTGTGAGGTAGCAGTGGATGATGGCTCTCG - 11820 

11821 - GGCTGACTTCTTTGCCCAGGAAGAAGGTGTGCAGCGGGCCCGGGAGGAGGAGGAGAAGCG - 11880 

EEGVQRAREEEEKR 

11881 - CAAGGAGGTGACCTCGCACTTCCAGGTGACACTGAATGACATTCAGCTGCAGATGGAACA - 11940 
-KEVTSHFQVTLNDIQLQMEQ 



11941 - GCACAATGAGCGCAACTCCAAGCTGCGCCAAGAGAACATGGAGCTGGCTGAGAGGCTCAA 
-HNERNSKLRQENMELAERLK 



- 12000 



12001 - GAAGCTGATTGAGCAGTATGAGCTGCGCGAGGAGGTAAGGGTATCACGGACAGCAGTCAT - 12060 
-KLIEQYELREE 

12061 - GGCCCAGAAATTGTGAGGTTTTGAGTGTGTGCTAGGCACTGGGACAGTACCTTTTCAGGC - 12120 

12121 - TTCATCCCATTCTCCCTTTCTTCCTCCTCCTCCTCCTTGGGAGGAGAGTAATGTTATTCC - 12180 

12181 - TCATAGATAAAAAACAGGTGTGGAGAAGAGACTCACTTACAGCCACACAGCCCCAGGTCC - 1224 0 

12241 - ACAGTGCCTTGTCCCAAATGACTGGGCCAGGCATCTTTTGGAATTAGAACTATCCACATT - 12300 

12301 - TTAGAATGGAGGTACATGTATGGACTGTGTGTTATATAGCACCCTCAGCAGGGCCTTGGG - 12360 

12361 - GAAGCCAGACACATTAATGTATTTATGCAGTAGAACTTCCAAATACTCACCTACATTATG - 12420 

124 21 - GGCTTACAATGATGCAGGTCAAGTCTGGCTGCCAGCTTATGACAATTTCCATTTTCAGAA - 124 80 

124 81 - CTTTGTAGAATTTGGAATTGCAGGGGAGGGGTGTACCTGTGATCAGTGATGGACTCCAGA - 1254 0 

12541 - GACTGTGTCCACTGATTCCTTGCTGCTCCTGCCACTCAAAAGGCAGAATTTATCAGGCTG - 12 600 

12601 - GGCGTGGTGGCTCATGCCTGTAATCCCAACACTTTGGGAGGCCAAAGCGGGCGGATCACC - 12660 

12661 - TGAGGTCAGGAGTTCAAGACCAGCCTGGCCAACATGGTGAAACCCTGTCTCTACTAAAAA - 12720 

12721 - TACAAAAAATTAGCCAGGTGTGGTGGTGCACGGCTGTAGTCCCAGCTACTCAGGAGGCTG - 12780 

12781 - AGGCAGGAGAATTGCTTGAACCCAGGAGGCAGAGGTTGCAATGAGCCAAGATTGTGCTAC - 1284 0 

12841 - TGCACTCTAGCCTGGGTGATATACCGAGACTCCATCTCAAAAAAAAAAAAAAAAAAAAGC - 12900 

12901 - AGGATGTCACTCCCTTTGTCACTGCGTTGGCTGCCACCCCAGGCACTTGAATCTTTGGAT - 12960 



FIG. 24E 
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12961 - CTTCCCTGCCAGTCACCTGGCTGTTCTGGGCGCGTTCTCATCATGAGAAGGGAGACCTGC - 13020 
13021 - AGCCCCCTTACAGGGCTGGCAGAGGACCTGCTCTGGATTAGGCCCTTTCCTAGCCCCTGG - 13080 
13081 - GGTGTGGCAGTGGGTGAGACCGGGAAGATCTGCCCTCTTAGGTTCATAGGCCAAAGTGAT - 13140 
13141 - GATCGTGTGTGCAGGACCTAGAGGGCbCTCCCCTGACCCACCCCTTTCCTTGCCATACTT - 13200 
13201 - CATCCTCTGGGAACAAAGCTGCTTGTTTGGTTTGAGGGGAGTTGGTTTGGTTCTTATCCC - 13260 
13261 - TCAGCGCTGAGACATAGAGGCTTCCTGGGCCACTACAGTGAGACACGAACTTCAAGAATC - 13320 
13321 - TGAATACCCCCGTTTTCTCTCCCCGCCAAGGCAAAAAAGGACTTAGTACTACCTGTGGAG - 13380 
13381 - AAGGAGGTGCAGGACTACCAGGCCCTGCTGCTTTGCATTTACAGCCCTCCCCAGACAGAC - 134 40 
134 41 - ACAGGCACCCTCATCATACCCAAACTGGACTTACCTGCTAGGCACCTTCCCTTCCCCATC - 13500 
13501 - CAAAAAAATGGAGTTATTTTCCCTTATTTCAGCAAGTCCAGTTGATTTTACCTTTGAAGT - 13560 
13561 - AGCACCTGAGTCCTTCACCTTCTCTCCATCCCTTCTCTCTCACCTGACACAGGTCTGCAG - 13620 
13621 - CGCTCCTCTAGTAGGCAGGACAGCCATTCCTTGGGGATGCACATGTCTAGTCTTTGCCTA - 13680 
13681 - GATATGGCAAGTCTTTGCCAACTGAGCTAGGCTGTTATGTTCTTAGAGGCATTGTTTTTG - 13740 
13741 - CCCATTCTTCCCATTTACAAGAGAATCAGGGACACAGAAGTGAGGGCTTCCAGCCCCATA - 13800 
13801 - GGTGATCAATCCTGGGGTCAGAGATTTGAGTGTGTTTATTGCTTGCCTTCTTGGGAGCAG - 13860 
13861 - ATTCCATCCATAAACCATGTGCTTACCAAGGTCTGACTCACTGGGAGAGAAACGACGTGA - 13920 
13921 - GGTTGGAAAGCTGACCTTCCAGAGACTTGGGGCCCATGTTGTGTGGTACACATGGGAGTC - 13980 
13981 - CATCATATCAGATTGAGATGGGGGGCTGGGCAAAGTGCCCTGGTCTGTGGCTGTGGGGCT - 14040 

14 041 - ACCCTGAGAAAGGGAGCGCCTGACAAGCCGACTGCTCCCACCATCTTTGTTGCAGCATAT - 14100 
_ HI 

14101 - CGACAAAGTCTTCAAACACAAGGACCTACAACAGCAGCTGGTGGATGCCAAGCTCCAGCA - 14160 
-DKVFKHKDLQQQLVDAKLQQ 

14161 - GGCCCAGGAGATGCTAAAGGAGGCAGAAGAGCGGCACCAGCGGGAGAAGGATTTTGTGAG - 14 220 
-AQEMLKEAEERHQREKDF 

14221 - GCTCAGGCCCCAGGGTTGGGGTGGGGGTGTGGGAGGAGACAGGCTGGGCTCTGGCTCAGC - 14280 
14 281 - TCATAGCCGGGTTATATGGGAGAAGTCTGGCCAGACCAGGCACAGATTCCTTGAGTACCA - 14 340 
14 341 - GTCTGAGAGCAGGAAGCCTCAGTGGGTCTGGTGCTTGTGGCTAAAAACCAAACATAGCCC - 14 400 

14 401 - CTGGGGGCTTCTGACAGGATCTGGGGTTCTGTCTTGGAAATAGCTCCTGAAAGAGGCAGT - 14460 

L L K E A V 

14 4 61 - AGAGTCCCAGAGGATGTGTGAGCTGATGAAGCAGCAAGAGACCCACCTGAAGCAACAGGT - 14 520 
-ESQRMCELMKQQETHLKQQ 

14 521 - GAGAGCATATAACCTGACCCTGTGCCTTCAAGTTTCCCTCACTGGGCCCCATCCTGGGGG - 14580 

14 581 - TAGTGAAATGGGACCCTCATTCTAGGACTGGCTGTGTCCTGGCTGCTATGACGCCTTGGT - 14640 

14 641 - TGAGCTTAGGTGGGCTCAGAGGACTTCATTTGTAGCTCAGAAATGTATTGCTTTTGAGGA - 14700 

14 7 01 - GGTAGGAACAGAAGAGTTTGAAAATCAACATAAAGGCAAAATAAAAGTCACCCTAAGTCT - 14 760 

14 7 61 - CCTACTTTCCAGGCTTAGCATTTTGGATTATATCCTTCCAAATATATAGCTTTGCTTTGT - 14 820 

14 821 - TTTAAGGAAAAATAGTATCTCAATAGAATTACTGGTCAGAGAGTCAAGGACGGGTCTGAG - 14 880 

14 881 - TGTGTTGACCAGAGTGCCTCCCAGAGAAACCCAGTCTTATCTGTGGGCTGCTTTCTCCCC - 14 940 

14 941 - ACAGCTTGCCCTATACACAGAGAAGTTTGAGGAGTTCCAGAACACACTTTCCAAAAGCAG - 15000 
LALYTEKFEEFQNTLSKSS 
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15001 - CGAGGTATTCACCACATTCAAGCAGGAGATGGAAAAGGTAACTGTGGTCCAGGCCAGGCA - 15060 
-EVFTTFKQEMEK 



15061 - TGGCTGCTGGGGCATAAGCTGCTTCATTCAAAATTGTTGGGCCTGCCTTCAGGAAGCTCC - 15120 

15121 - CATCTGGGGTGTCTCAAGGGCAGGGCTGTTAGGAAGGTTCACAGCCTTTCCCCTCTTGAG - 15180 

15181 - GCAGTATCAGTGGTATGTATACACTCCAGGTTGTCCCAGGGAATGGGGCAGTCTTTTCTG - 15240 

15241 - TTTGTTTGGTTTTTTTGGGGGGTTTGTTGTTGTTGTTGTTGTTGTTGTTGTTGTTTGAGA - 15300 

15301 - TGGAGACTCACCTATTGCCCAGGCTGGAGTGCAGTGGCATGATCTCAGCTCATTGCAGCC - 15360 

15361 - TTTGCCCCCCGGGTTCAAGTGATTCTCCTGCCTCAGCCTCCTGACTAGCTGGAATTACAG - 154 20 

15421 - GCGCGTGCCACCATGCCTGGCTAATTTTTTCTTTCTTTTTTTTTTGTATTTTTAGTAGAG - 15480 

154 81 - ACGGGGTTTCACCATGTTGGCCAGGCTGGTCTCGAACTCTTGGCCTCAAGTGATCTGCCC - 1554 0 

15541 - GCCTTGGCCTCCCAAAGTGCTGGGATTATAGGCGTGAGCCACCATGCCTGGCCCCTTACC - 15600 

15601 - ATTCCTTGTTATTGGTGGTGGACACCTCTGACTTCCTGGTGGTGAGGTGGCACAGAGGGC - 15660 

15661 - ATTGACTGCATCCTGTAATGCCTTGCGCCTTGGGATCAATCATTCCCCACCTTGGAGACA - 15720 

15721 - CAGGTGCAGTCCCCACCTTGGAGACACAGACCTTGGAGAGGCCAGCTCTGACCATTTCCT - 15780 

15781 - TCTGTCTGTCACATAACCTAGATGACTAAGAAGATCAAGAAGCTGGAGAAAGAAACCACC - 1584 0 
_ MTKKIKKLEKETT 

15841 - ATGTACCGGTCCCGGTGGGAGAGCAGCAACAAGGCCCTGCTTGAGATGGCTGAGGAGGTGG - 15901 
-MYRSRWES S NKALLEMAEE 



15902 - GCTGTCTGTGATCTGCAGCCAGGGTGGGGGTGTGCACTTAGCGCATATCAGGCCCTTTCC - 15961 

15962 - TGTATGTTCTACCCATCAGTGACACAGCTAGCATGAGGTAGAGGTGAGATTTGCACACAA - 16021 

16022 - TGTCCAAGTCCAAAGTTAATGCTGTTCTCTCCCCATGGGAGGTGGTGAGCCCAGTGGTAG - 16081 

16082 - GTCTCCAGTGGGAGTGAAGGGAGCAAATGGAAGAAAGGAATAAAAGAGCAGAAAAAAACG - 16141 

16142 - GGTGCCAGTGATGTGCCTGGTTTACATGTAAAGCAGCCCAGGTAGTTTGTGATTTCACAG - 16201 

16202 - CTTGTAATGTAGAAGAAAGGAACTAACGATGGAGCAGCAACTGCAAGCCAGACCTTGCTG - 16261 

16262 - AAAGTTTTTGGGTTTTTTTTGTCTTTTTTGCTGCTGAATGTTTTTAGGTACGTTGTTCAT - 16321 

16322 - TGAACCTTCTCTTGAGCTCTGAGGATGGTATTAGTAGTCCTGTTTTATAGATGAGACAGG - 16381 

16382 - CTCAAAAGTCAAGTCCTTTGCCAAGGTCACGTGGTAGATAAATGGAGGAATACGTTATCT - 16441 

16442 - CCAAGCCGTGCCCCTTTTCTGCACCATGCTGCCCCACCTGACAGCCTAGTCATGGCTTCA - 16501 

16502 - ACTAGGACTGTTTCCTAAAGGGGGCCAGCTTTGGACTCGGTCTGCTCTCAGCCTTGTTAA - 16561 

16562 - AGTGTTTGCCGCCAAGTGGTGATGGTAAGTGGGAGGiTGATGGGGCACGGCACTGAAGGT - 16621 



16622 - CTCATTTCTTTCCCTAGAAAACAGTCCGGGATAAAGAACTGGAGGGCCTGCAGGTAAAAA - 16681 

- KTVR DKELEGLQVKI 

16682 - TCCAACGGCTGGAGAAGCTGTGCCGGGCACTGCAGACAGAGCGCAATGACCTGAACAAGA - 16741 
QRLEKLCRALQTERNDLNKR 

16742 - GGGTACAGGACCTGAGTGCTGGTGGCCAGGGCTCCCTCACTGACAGTGGCCCTGAGAGGA - 16801 

- VQ DL SAG GQGS LT D S G P E R R 

16802 - GGCCAGAGGGGCCTGGGGCTCAAGCACCCAGCTCCCCCAGGGTCACAGAAGCGCCTTGCT - 16861 
PEGPGAQAPSSPRVTEAPCY 

168 62 - ACCCAGGAGCACCGAGCACAGAAGCATCAGGCCAGACTGGGCCTCAAGAGCCCACCTCCG - 16921 
PGAPS T EAS GQTG.PQE P T S A 

16922 - CCAGGGCCTAGAGAGCCTGGTGTTGGGTCATGCTGGGAAGGGAGCGGCAGCCCAGCCAGG - 16981 



R A 



★ 
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16982 - CCTGGCCCATAAAAGGCTCCCATGCTGAGCAGCCCATTGCTGAAGCCAGGATGTTCTGAC - 17041 
17042 - CTGGCTGGCATCTGGCACTTGCAATTTTGGATTTTGTGGGTCAGTTTTACGTACATAGGG - 17101 
17102 - CATTTTGCAAGGCCTTGCAAATGCATTTATACCTGTAAGTGTACAGTGGGCTTGCATTGG - 1 ^ 161 
17162 - GGATGGGGGTGTGTACAGATGAAGTCAGTGGCTTGTCTGTGAGCTGAAGAGTCTTGAGAG - 17221 
17222 - GGGCTGTCATCTGTAGCTGCCATCACAGTGAGTTGGCAGAAGTGACTTGAGCATTTCTCT - 17281 
17282 - GTCTGATTTGAGGCTCAGACCCCTCCCTGCCCTTCAGAGCTCAAGACAAGTAATACACCC - 17341 
17342 - AGGTCTTGACTGCATTTGTCTTGTGAGCAGGGCTTGCTTGGTCAGCTCAGGCCCTCCTAG - 17401 
17402 - CTGCTCTGGAGGCTCCTTTGATTCTCTAGACCTGGAAAAGGTGTCCCTAGGCAGAGCCCT - 174 61 
17462 - GGCAGGGCGCTCAGAGCTGGGGATTTGCTGCCTGGAACAAGGGACCTGGAGAATGTTTTT - 17521 
17522 - GCGTGGGATGATGTGCTGGTCAGGAGCCCCTTGGGCATCGCTTCCCCTGCCCTTTGGTAG - 17581 
17582 - TGCCAGGACCAGGCCAATGATGCTTCTCAGTAGCCTTATCATTCACAGGTGCCTCTCTAG - 17 641 
17 642 - CCTGCACAAATGATTGACAAGAGATCACCCAAAGGATTATTTCTGAAGGTGTTTTTTTCT - 17701 
17702 - TTATTTCTTTTTCTTTTTTTTTTTTTTTCTTTTTCTTTTTTTTTTGCACATGACAGTGTT - 17761 
17762 - TGTATTGAGGACCTTCCAAGGAAGAGGGATGCTGTAGCAGTGGTGCCTGGGTGCCTGGCC - 17821 
17822 - TCCAGTGTCCCACCTCCTTCACCACCCCACTTGGCTCCTTTGCCATCTTGATGCTGAGGT - 17881 
17 882 - TTCCTGTTTGGTGAGATCAGGTTGTTTGTGGTAAAAGAAAGGAAAGGGCTTCTGATGGCT - 17 941 
17 94 2 - TTGCCACAAGCTTACCTGTGGGTTTCAGTCCTGAGAGGCCACCACCAGTTCCCATCAGCA - 18001 
18002 - CTGTCTCCATGCAGCAGTTGCTGGGTCCCATGTCCAGCTGCCTCTTTGGCTTCAT-'GGGTT - 18061 
18062 - TTTCTGCTTCCTGCCCCCACCCCCACATGTGCAATCCTCAAGATTTGTCCTGATTCTATT - 18121 
18122 - TCCTGGCACCTCCCTGCCTGTCCTTGGGGATTCTACTTCTTCCTGTGTGGGAGCCCATAG -i 18181 
18182 - CTGTTGTCTAACAGGTAAGAAATGAAATTGAACTATTGACTGGGCCCCAGAAATCCATAA - 18241 
18242 - AATGGCTGCAGACAGTTGTTTCTGTGTCCTGTTCTACCCCCACTCCAGTACATAACTACT - 18301 
18302 - ATGTACTGTGTAGAGCCATTCTATATGCTGAATGTTCTGCTGTTGCAAACTTGCCAGGGT - 18361 
18362 - ATTAGCCAGTGTTTGTGCCAAGCAGTTTTCTGGGACAACAGAATGACTCAGACCAAGATG - 18421 
18422 - GATAGGATGGTTAGGGCTTTGCTTCTTGCTGTTTTTCTTTGAAGCTAGTTCATTGTCCTG - 18481 
18482 - CAGGTCCCTTCATCTTCCATACCTAGCCCACTCTTTTAGCCCTTACCTTAAATCTCTCAG - 18541 
18542 - ATAAGTTGGTTCACAAAGAATGTTAAGTACTGAATCATGTGTGACTGAGACCAGAGATGG - 18601 
18602 - CAAATGAATGGCACACCATTTCTCCTTCTCCTGCCCCAGGGCAGGTACCACTGATCTGCA - 18661 
18662 - TCAGAGTTGCCTGCTATTCTCTGGTGTATCCTTCACATCTAGGTGCCCTCAAGCAGCTGT - 18721 
18722 - GTGAGTGTTGAGATCTCTGCCATCTCTGGCTGAGATACTGCTGTCCTGTGAAGTGTTTCC - 18781 
18782 - CATGACCTTTTTCTTCCCCTTTGAATCCCTCTGTCTGGAGTAGTCCTTGCCTCTTCCTGC - 18841 
1884 2 - TCCAGTAGGGCCTTTTCCCTACCCCAGCCCCTGTGCCAGGCTAAGCTGGTACAAGAGCTG - 18 901 
18902 - CCAACCTCACAGAGTGTTTGCTAGGCGAGAGAGGTGCAGGGAAGAGGCAGAGGTATGCAC - 18961 
18962 - CTTCCCCCTTGAAGAGAGGGGAAAGGCCTACAGTGGCCCACATAATTGCCTGACTCACAC - 19021 
19022 - TTCAGCTACCTCTTAATGCCTGTGGAGGGACTGGAGCTGCTGGATCCCAGTGTGGTGGTG - 19081 
19082 - TAGGAGGCCACAGTGAGCAGGTGGCCCCAGCTGGGTTTCCCAGGTCAGG'AATGTGGG'CCC - 19141 
19142 - CAGGCAAGGTGCAGCCTTTGCTCACAGCTCCATCCATGTCTAGACCTTCAGGCCAGTCTG - 19201 
19202 - CAGATGAGGTTCCCTACCTTTTTCTTCTCTTCATTGACCAAATCAACCAATCACTACAGC - 19261 
19262 - TGCTCTGCTTCTGCTTTCCAAAGTAGCCCAGGTCCTGGGCCAGATGCAGGGGAGGTGCCT - 19321 
19322 - ATCCATGAGTGAAGGCCAGTGTCTTCCTCACCTGGGTGGGTCCCACACTTGTGACCTCAG - 19381 
19382 - TTTTAGGACCAAGATCTGTGTTGGTTTCTTAGATTGCTAGCTTTTCCTCCAGGGGACCAC - 19441 
19442 - AGCAGGTGAAGCTCAAGAGCGCATGGCTCTGCTAATAGTAAATTGTTTTCAGGGCCTTGT - 19501 
19502 - CCAGCTGAGAGCTTCATGTCCACCAGATTCTGAGAGGTGTCAGCAGCACTTTTTTTTTTT - 19561 
19562 - ATTTGTTGTTTGTTTTCCATGAGGTTATCGGACCATGGGCTGAGCTCAGGCACTTTCTGT - 19621 
19622 - AGGAGACTGTTATTTCTGTAAAGATGGTTATTTAACCCTTCTCACCCCATCACGGTGGCC - 19681 
19682 - CTGAGGGCTGACCCGGAGGCCAGTGGAGCTGCCTGGTGTCCACGGGGGAGGGCCAAGGCC - 19741 
19742 - TGCTGAGCTGATTCTCCAGCTGCTGCCCCAGCCTTTCCGCCTTGCACAGCACAGAGGTGG - 19801 
19802 - TCACCCCAGGGACAGCCAGGCACCTGCTCCTCTTGCCCTTCCTGGGGGAAGGGAGCTGCC - 19861 
19862 - TTCTGTCCCTGTAACTGCTTTCCTTATGGCCCAGCCCGGCCACTCAGACTTGTTTGAAGC - 19921 
19922 - TGCACTGGCAGCTTTTTTGTCTCCTTTGGGTATTCACAACAGCCAGGGACTTGATTTTGA - 19981 
19982 - TGTATTTTAAACCACATTAAATAAAGAGTCTGTTGCCTTACTTGTTTCTCTCCTGACCTG - 20041 
20042 - TGTATTCCTTTGTTTCTGGATCTGATCCATTCAGCCCCTTCCATCATCACTGACTTGTTC - 20101 
20102 - AGGTCTGCTGCAGAGCGCCCATGGTGGTTCCCTGGTATCTTACATATTCCACAGTGTCTT - 20161 
20162 - TGAGCAGTCGCCACAGCCTCAGGATGCTGGCATATTCACTTGAGCTGCCTGAGTGGAGCC - 20221 
20222 - CTTGGCAAAGTTGGCAAGACCCTTGCCTCAGAGAGGATCACACACACACAAAAAAGTTTT - 20281 
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20282 - CCCTGACCTGGGGGCTCACAGGCTAGTGAAGGGAAAAGGTACTTTTAGCTATAGACAGGT - 20341 

20342 - CAATGGTGCTGAGAGCAGAGAGGAGGCCCCTGCCCCCTTCAGCAAGGTGAGGGGGTGATA - 20401 

20402 - CCTGGAATGGCCTTCTGAACCACAGGGCAGGTAGAAGATGAACGTCATTTAGTGATTAAA - 204 61 

204 62 - TGGTACAGCTGGGAAGCAGGTCCATGGGACTGGGAGAGGGGGTGAGGCTGGGCCCAGAGT - 20521 

20522 - CTGGGTACCAGGTTAAGGAATGTGGGCTAGATCCAGAGGGCAGGGGGGGCAACTGAAGGT - 20581 

20582 - GTTTCAATAGGAAATTGATAGGCTCCAGCAGTAAGGCAAAAGGCATGGAGCCAGGCATAG - 20641 

20642 - GCCATTTGAGGCCCAGGTTAAGAGGGGTGGACACTCATCACTGCTATTTGGGTCTGAGCT - 20701 

20702 - GTGGGTAGGCTCCTATAGCCCTGGCCTGCCCAAGGGAATTCACAGGGGCCTCTAATTGTA - 207 61 

20762 - TGCATTCCTTAAGGAGAGCACATTCTCTGTTCAGTTTTTACACCCCCCATTTACCCACCT - 20821 

20822 - CAAGCATGGGACTCCTATATGGGAGACATGCTGCTGGTGGCCTCACCCAGCACCCTGTTC - 20881 

20882 - TCTCTGGGTCCTGGGTTGGTCAGGCACAAAGGATGATATGTGCTGAATGCCCAGGAAATG - 20941 

20942 - GCAGAGACAACCCACCTGCCCTTCCCTCCAGGCCTCCACAAATAGATGTGCCCACAATGA - 21001 

21002 - CTGTGACAGTCCCAGCAGAGCCTCTGACCCTTCTAGCTGGGTCCTGATACATGTTTTCCA - 210 61 

21062 - TGCTGGCCATGTTATTTCTAGTCGCAGATCCTCTGGAGGGTGTGGGGGGGGTGCCGCCCC - 21121 

21122 - AACTCTTGGAGATTCCAAGCAAAGCAGCTCTGAGAATAATGAGGTTTCTGACCCCCCAGT - 21181 

21182 - GAAGCAGCTGAGGATGGGAACCACAGGGGTGCTCCCTCTGTCAGCAGCATTACCACTGTC - 212 41 

2124 2 - TACTCTAGCAGCTCCGGTGGGGAAGGAGAGGGATTTCTGTTGTCCCCAGTCTGGG ( CCCCT - 21301 

21302 - GGTTATTGAAAAAGTTCGGAATTACTCTTTACCCTTGTGGAGTGTTCTGAGTGTTGGAAG - 21361 

21362 - TACCCAGGAAGAAGCCCTGAGCAGGTGCCCTCAGGAGCAGTGCCCATGGCTCCCCACATC - 214 21 

214 22 - AGCCAAGAGGCCCAACCCCAGGAAGCCACTCCTGCCCGGGGATGGGGAAGGTGGGCTGGG - f 214 81 

214 82 - TGGCTGTGTGCACTGCCCTGGGCCAGCTCACTTGAGCCTGCTGAGCCGCCTGGCCAAACA - 21541 

2154 2 - TGAGCCTCTCTCCTGTTGTATCAGATGCTGTTCTGGGGACCTGCGCCAGGAGCCTCTGCC - 21601 

21602 - AGGGCTTTAAATAGCTGCCCCCATTGATCTGGCTGCAGGCAGCAGCAGTCACACTGGGTC - 21661 

21662 - AGCCTCCATCAGGTGCTCAGGTTTCCCTGAGGACTGGAGTCAGGTGCCAGGGAATCGCGT - 21721 

21722 - GGTCTACCTTATGACCTGGTGCTCCCCACACCTGTCTCCTAGGCCTGGGGGGTGGGGAGG - 21781 

217 82 - ACTCCTGTCACTTCATCTGCGGCAAAATACAGCCCCCACCACTTACCAGAGAAAACTGTC - 218 41 

218 4 2 - TGGCATTGTAGAGAGAGGGGTTTTGCCCTCAAAAGACTGTTGCTTACTTTCAGTAGAATG - 21901 
21902 - GGG AATGACACTGGTATCTTCCTTAAGGGTTGTTATGGGGATGAAATGTATGTAAAGTGC - 21961 
21962 - TCAATAGGGCACTGGACTCACTCCATTGATGGCTGTCTTTGCTCGAAGTGTCTTCCTGAT - 22021 
22022 - GCTGCTGCTGTTGCTGCTTGTGCTTCTTCTGTGCTTACATTCTCTCTCTCTCACTCACTC - 22081 
22082 - ACTCTGTCTCTCCTCTCCCCCGCCCCACCCCCTTTCTGACAAAGCCACCACCATTTTGTA - 22141 
22142 - AGGAACTGTAGCTTCTCTCTGAAACTGCCGGGAAAGGGAAAATCTTTTTAAAATAGACAT - 22201 
22202 - CACACAACCAACAGGGTCCCCTAGGTTCAGGCGGGGAGGTGAGGTCGAGTGAGA - 22255 




